Postdoctoral Researcher for AI Data Provenance, Quality and Maintenance (EU Project MINERVA, EU Project ELLIOT, EU Project Open Euro LLM) (E 13 TV-L, 100%, m/f/d)
The Tübingen AI Center aims to foster a world-class research ecosystem in the field of Machine Learning and Artificial Intelligence. It is one of five competence centers funded by Germany's Ministry of Education and Research and collaborates closely with the first institute of the European Laboratory for Learning and Intelligent Systems (ELLIS). It is part of the major Cyber Valley initiative, where many partners in academia and industry have joined forces to work on breakthroughs in artificial intelligence. The Tübingen AI Center is a joint institute between the University of Tübingen and the Max Planck Institute for Intelligent Systems, which are top academic institutions in artificial intelligence. An important part of the Tübingen AI Center is the software engineering team that supports the research ecosystem in website platforms, computing infrastructure and research projects.
The Tübingen AI Center is looking for a postdoctoral researcher to support the European efforts to train large-scale foundational models from the perspective of data quality and sourcing. In this position we are looking for a capable machine learning scientist with a background in data curation to
- Work with European data owners, such as libraries, to discuss transparent inclusions of their corpora to strengthen European perspectives in new AI models.
- Devise mechanisms that clearly record data provenance at scale and provide opt-out options appropriate for European law.
- Research new strategies to use machine learning tooling to improve data quality, extraction and tracing, for example through large-scale deployment of better OCR, or machine-generated Q&A.
Data quality is a crucial ingredient of modern machine learning models, and we are happy to support all efforts that raise the waterline of openly available data quality.
Qualifications:
- PhD in Computer Science or a related technical field.
- A strong interest in computer science and artificial intelligence.
- Curiosity driven person with interest in research for example demonstrated by a first publication.
- Experience in machine learning, computer vision, or a related area.
- Excellent programming skills in Python.
- Good mathematical foundations in linear algebra, probability, and statistics.
Data generated and maintained by these efforts are a critical stepping stone toward the next generation of open-source models, whose properties will be irrevocably shaped by the data they are trained on. This position is part of the European Minerva project that connects different research partners and major HPC providers.
Show us what you can do by providing links to your portfolio examples, GitHub or online source code repository.
What we offer
The position is going to start as soon as filled. Our team is passionate about AI and consists of people from all around the world. This position is a great opportunity to gain or advance skills in machine learning. We have a flexible structure and you are encouraged to also push your own ideas and projects.
Application and Deadline
The University of Tübingen is committed to equal opportunities and diversity. Individuals with disabilities who are equally qualified will be given preference in the hiring process. In line with its goal to increase the proportion of women in research, the university strongly encourages qualified women to apply. The position is available for job sharing. Employment will be managed by the central administration of the University of Tübingen. Please submit your complete application documents (including cover letter, CV, and relevant certificates) as a single PDF file via email to applications@tuebingen․ai by August 31, 2025.