Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Digital Humanities @ University of Pisa Alessandro Lenci CoLing Lab – Laboratorio di Linguistica Computazionale Università di Pisa Aix-Marseille Université 26 November 2015 Main research areas (not an exhaustive list…) • Natural Language Processing (NLP) • lexicons, term extraction, annotated corpora, NLP tools, etc. • NLP and cognitive science • NLP applied to DH • legal text Processing, computational dialectology, historical text processing, etc. • Digital philology • Greek and Latin processing • digital text and manuscript encoding and visualization • digital epigraphy • Big data analysis, social network analysis, etc. • 3D visualization and reconstruction applied to historical and • archeological research Development of online databases for the humanities • literature, linguistics, history, archeology, etc. Academic and research institutions involved in Digital Humanities in Pisa • Università di Pisa • Department of Philology, Literature and Linguistics » Computational Linguistics Lab (colinglab.fileli.unipi.it/) » Phonetics Lab (www.humnet.unipi.it/linguistica/lab_fonetica/index.ht ml) » Digital Culture Lab (labcd.humnet.unipi.it/) • Department of Informatics » Media Lab (medialab.di.unipi.it/wiki/Projects) • Department of Civilization » sections of Philosophy, History of Arts, History, Archeology Academic and research institutions involved in Digital Humanities in Pisa • CNR • Istituto di Linguistica Computazionale (ILC-CNR) » CLARIN coordinator for Italy • Istituto di Scienze e Tecnologie dell’Informazione (ISTICNR) » Visual Computing Lab (vcg.isti.cnr.it/) » Human Interfaces in Information Systems Laboratory Lab (giove.isti.cnr.it/) » Knowledge Discovery and Data Mining Lab (wwwkdd.isti.cnr.it/) • Istituto di Informatica e Telematica (IIT-CNR) • Scuola Superiore Sant’Anna • Perceptual Robotics Lab (www.percro.org/) CoLing Lab http://colinglab.fileli.unipi.it War Memories (Memorie di Guerra) http://www.memoriediguerra.it/wwm/ • An ongoing project to carry out a computational analysis and semantic indexing of Italian texts about WWI • University of Pisa, CoLing Lab • ILC-CNR, Pisa • history consultant: Prof. Nicola Labanca (University of Siena) • Texts are annotated automatically with state-of-the-art NLP tools to extract various kinds of information • simple and multi-word terms • named entities • events and their participants • georeferenced locations War Memories (Memorie di Guerra) http://www.memoriediguerra.it/wwm/ Natural Language Processing HLT kos terms thesauri ontologies Informatica Umanistica (Digital Humanities) http://www.fileli.unipi.it/infouma/ • Started in 2002, Informatica Umanistica includes a 3-years bachelor (Laurea) and a 2-years master (Laurea magistrale) • hosted by the Dept. of Philology, Literature and Linguistics, in collaboration with the Dept. of Informatics Informatica Umanistica bachelor program study plan • First year (60 credits) • informatics » Foundations of programming languages (12 credits) » Web design and programming (12 credits) • humanities » Cultural geography (6 credits) » Italian linguistics (9 credits) » Writing laboratory (6 credits) » English language (9 credits) » General linguistics (6 credits) Informatica Umanistica bachelor program study plan • Second year (60 credits) • informatics » Algorithmics (6 credits) » Databases and Web Laboratory (12 credits) • humanities » Introduction to historical studies (6 credits) » Italian contemporary literature (6 credits) » Computational linguistics (12 credits) » Italian Literature (12 credits) » History of arts (6 credits) Informatica Umanistica bachelor program study plan • Third year (60 credits) • informatics » Telematics (6 credits) » One course among (6 credits): » Digital libraries, Multimedia production, E-learning technologies, Graphic design • humanities » Text encoding (6 credits) » Latin language or literature (6 credits) » {English, French, Spanish, German} literature (6 credits) • Free choice courses (18 credits) • Stage (6 credits) • Final thesis (6 credits) Informatica Umanistica bachelor program • Professional profiles after the bachelor • content managment and Web development • language technology • electronic publishing houses • e-learning • journalism and communication Employment rates after the bachelor data collected in 2013 Informatica umanistica Humanities employment rate after 12 months since the degree employment rate after 12 months since the degree employed 62.5% employed 44.4% unemployed 37.5% unemployed 55.6% job coherence with the degree job coherence with the degree high 20% medium 25% medium 40% low 25% low 40% null 50% job satisfaction job satisfaction high 20% high 25% medium 60% medium 50% low 20% low 25% Informatica Umanistica master program • • • • • • Programming and data analysis (15 credits) Italian linguistics II (12 credits) (except for curr. D) 12 free choice credits 6 credits for the Seminar of Digital Culture 21 credits for the final thesis 54 specific credits for each curriculum: • • • • A. Electronic Publishing B. Graphics, Interactivity and Virtual Environments C. Knowledge Management D. Language Technology Informatica Umanistica Electronic Publishing • 18 credits among: • Social network analysis, Electronic publishing, Collaborative work platforms, Interface design and usability, Information retrieval, Digital philology • 12 credits among: • Cartography, Legal aspects of informatics, Digital history, History of printing and publishing • 24 credits among: • Communication, Editorial writing, Online journalism, Internet marketing, Italian contemporary literature, Methods of physics for the humanities, Sociology of cultural processes, Technologies for Web marketing, Theory of literature, Visual analytics Informatica Umanistica Graphics, Interactivity and Virtual Environments • 18 credits among: • Digital audio, Electronic publishing, 3D graphics for cultural heritage, Collaborative work platforms, Interface design and usability, Interface programming • 12 credits among: • Cartography, Legal aspects of informatics, Digital history, Theory of TV and multimedia arts • 24 credits among: • Virtual environments, Communication, Online journalism, Seminar on cinema, Geographical information systems, Sociology of cultural processes, Technologies for Web marketing, Technologies for e-learning, Visual analytics Informatica Umanistica Knowledge Management • 18 credits among: • Machine learning, Databases for decision support, Data mining, Data-driven decision methods, Artificial intelligence, Social network analysis, Information retrieval, Collaborative work platforms • 12 credits among: • Cartography, Legal aspects of informatics, Digital history, • 24 credits among: • Communication, Knowledge management, Internet marketing, Logic, Geographical information systems, Informaton technologies for literature production, Methods of physics for the humanities, Technologies for Web marketing, Technologies for e-learning, Visual analytics Informatica Umanistica Language technology • 18 credits among: • Machine learning, Data mining, Artificial intelligence, Social network analysis, Information retrieval, Natural language processing, Information retrieval • Italian linguistics (12 credits) is replaced with: • Computational linguistics II • General linguistics II • 12 credits among: • Cartography, Legal aspects of informatics, Digital history, • 24 credits among: • Applied linguistics, Phonetics and phonology, Philosophy of languages, Logic, Neurolinguistics, Methods of physics for the humanities Enrolled students bachelor 2009 2010 2011 2012 2013 2014 83 76 78 96 104 115 master 2009 2010 2011 2012 2013 2014 - 40 52 58 60 49 master students with a bachelor degree at Uni. Pisa 2009 2010 2011 2012 2013 2014 - 81.6% 73.1% 64.9% 56.9% 44.7% Thanks!