This is the main course website for the lecture Data Modeling and Knowledge Generation in winter term 2021/22 at University of Bayreuth.
- 30. November 2021: Jump to the schedule to download the latest slide deck!
- 26. November 2021: Third practical is online. Today we will be working with git!
- 23. November 2021: Lecture number six is online!
- 16. November 2021: Lecture number five is about databases.
- 12. November 2021: Second practical done. That’s the first simple data inspection and cleaning!
- 09. November 2021: Lecture number four is online!
- 02. November 2021: Slides for third lecture are online.
- 29. October 2021: First practical finished! Yay! Download slides and material from the schedule!
- 26. October 2021: Slides for second lecture are online.
- 19. October 2021: Lecture starts and introductory slides are online! See the schedule to download the slides.
Recordings of the lecture are available online but require an account of UBT.
Data models represent the real world in the analysis process, they act as their placeholder, so to speak. As such, they create their own reality for the analyses. The formulation of data models is always subject to conscious and unconscious selection and transformation decisions. These decisions implicitly influence the way algorithms and analysts understand and process the real world. At the same time, data models act as blueprints for a real world that comes after analysis. Finally, analysis results are produced and evaluated with the help of data models and communicated as new knowledge. The decisions mentioned above therefore have far-reaching implications for the expected results and the knowledge that can be gained from these results. This dual role of description and prescription opens up a field of tension for the analysis process in interdisciplinary research as well as in numerous business areas that make use of "data driven decision making", for example. Only when the data model, algorithm and results are viewed as a holistic unit of an analysis process can reliable knowledge be gained from data.
In this course, different methods for data analysis and knowledge generation will be presented - including methods from the fields of machine learning, data mining, text mining, social network analysis and information visualization. These methods, which are currently widely used in science, business and beyond, bring about different requirements for the modelling of data. These requirements are viewed critically. The implications for the expected results and the knowledge derived from them are explicitly stated.
Students will learn different methods for data analysis and knowledge generation - including methods from the fields of machine learning, data mining, text mining, social network analysis and information visualization. Students become aware of the requirements for the required data models that the different analysis methods entail. Students know how to critically question data analyses, how to specify the implicit modelling decisions and how to evaluate analysis results always against the background of these decisions.
|Session was a lecture|
|Session was a practical|
|Find slides here|
|Find code material here|
|Find external material here|
|This session has a self assessment quiz attached|
- Markus Putnings, Heike Neuroth, and Janna Neumann, editors. Praxishandbuch Forschungsdatenmanagement. De Gruyter Saur, 2021. ISBN 9783110657807. doi:doi:10.1515/9783110657807.
- Kieran Healy. The Plain Person’s Guide to Plain Text Social Science. The Internet, 2019. https://plain-text.co/.
- Julia Flanders and Fotis Jannidis. The shape of data in digital humanities: modeling texts and text-based resources. Routledge, 2018. doi:10.4324/9781315552941.
- Fotis Jannidis, Hubertus Kohle, and Malte Rehbein. Digital Humanities: Eine Einführung. Springer, 2017. doi:10.1007/978-3-476-05446-3.
- Jo Bates, Yu-Wei Lin, and Paula Goodale. Data journeys: capturing the socio-material constitution of data objects and flows. Big Data & Society, 3(2):1–12, 2016. doi:10.1177/2053951716654502.
- Alfons Kemper and Andre Eickler. Datenbanksysteme: Eine Einführung. Oldenbourg Verlag, München, 2015. ISBN 978-3-11-044375-2.
- Susan Schreibman, Ray Siemens, and John Unsworth. A companion to digital humanities. John Wiley & Sons, 2008. http://www.digitalhumanities.org/companion/.
- Willard McCarty. Humanities Computing. Palgrave Macmillan UK, 2005. ISBN 978-1-4039-3504-5.
- Hans-Jörg Bullinger, Kai Wörner, and Juan Prieto. Wissensmanagement — modelle und strategien für die praxis. In: Wissensmanagement: Schritte zum intelligenten Unternehmen, pages 21–39. Springer Berlin Heidelberg, Berlin, Heidelberg, 1998. doi:10.1007/978-3-642-71995-0_2.