Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
11,65 0,0 12,20 6,32 3,14 Between Abundance and Austerity: Academic Work in Times of Data Mining 0,0 Jeanette Hofmann WZB/HIIG Crossing Borders – The Future of Access: International Conference at the German National Library Frankfurt, 8th April 2014 7,90 8,65 Photo: Cushing Memorial Library and Archives, Texas A&M. CC BY 2.0. 11,65 6,32 3,14 0,0 7,90 8,65 0,0 12,20 11,65 6,32 0,0 12,20 Tale 1: Building a Text Corpus 3,14 0,0 § epol: joint project “post democracy and neoliberalism” (funded by German ministry of research) § Goal: discourse analysis of neoliberalism in public opinion § Analysis of patterns of reasoning by means of text mining § Data source: German newspaper articles from 1949 to 2011 7,90 8,65 Photo: Boston Public Library. CC BY-NC-ND 2.0. 11,65 6,32 3,14 0,0 7,90 8,65 0,0 Tale 1: Building a Text Corpus 12,20 11,65 6,32 0,0 12,20 Tale 2: Making Use of a Database 3,14 0,0 § “Politikfeld Internet” § Goal: Studying an emerging policy area § Tracing public accounts of Internet-related problems, causal attributions, solutions § Text mining with a focus on co-occurrence of terms, organizations, etc 7,90 8,65 Photo: Cushing Memorial Library and Archives, Texas A&M. CC BY 2.0. 11,65 6,32 0,0 12,20 Tale 2: Making Use of a Database 3,14 0,0 7,90 8,65 § LexisNexis terms and conditions (1) § “Download […] for no more than 90 days, primarily for that Authorized User’s exclusive use, a single copy of insubstantial portions of those Materials […].” § “[…] downloading and storing Materials in an archival database is prohibited.” § “Use of the Online Services via mechanical, programmatic, robotic, scripted or any other automated means is strictly prohibited.” 11,65 6,32 0,0 Tale 2: Making Use of a Database 3,14 0,0 7,90 8,65 § LexisNexis terms and conditions (2) § Web front-end § Max 3.000 results for queries § Max 500 documents per download 12,20 11,65 6,32 0,0 12,20 Copyright Law & Digital Humanities: Incompatible Worlds?! 3,14 § Copying protected works 0,0 § Text extraction, creating a database • Understanding intellectual • Linguistic analysis of terms content • Large scale non• Enjoying expressive quality consumptive computational of literature analysis 7,90 8,65 Photo: Cushing Memorial Library and Archives, Texas A&M. CC BY 2.0. 11,65 6,32 0,0 12,20 Consequences for Research 3,14 0,0 § § § § § § § Time consuming negotiation of contracts Prohibitively expensive use licenses No long-term data archiving, waste of data copora No open access to data sources No reproducibility/validation of research results Slowing down of methodological innovation Tensions between research administration and scholars! 7,90 8,65 Photo: Cushing Memorial Library and Archives, Texas A&M. CC BY 2.0. 11,65 6,32 0,0 12,20 Abundance meets Austerity 3,14 0,0 § Abundance § Bottlenecks § Innovative analytical § Access to data tools § Rights of use § New research questions § Rights to share data with § Text and data sources peers 7,90 8,65 Photo: Cushing Memorial Library and Archives, Texas A&M. CC BY 2.0. 11,65 6,32 0,0 12,20 Conclusion: Generous Exceptions for Education and Research Needed 3,14 § Academia depends on access to text & data, freedom to share its results. 0,0 § Narrow exceptions are ineffective: compliance issues § Fair use-like provisions for academic non-commercial research (licenses are not enough!) § Recognition of large-scale, automated text/data mining as non-consumptive use § Digital humanities cannot flourish if understood as utilization of protected works 7,90 8,65 Photo: Cushing Memorial Library and Archives, Texas A&M. CC BY 2.0. 11,65 0,0 12,20 6,32 3,14 0,0 Dr. Jeanette Hofmann Thank you! 7,90 8,65 [email protected] Wissenschaftszentrum Berlin für Sozialforschung Reichpietschufer 50 D-10785 Berlin, Germany