Prof. Dr. Jana Schor

(geb. Hertel)

Leitung Data Science in der Bioinformatik

Helmholtz Zentrum für Umweltforschung - UFZ
Department Integrative Bioinformatik
Permoserstr. 15
04318 Leipzig

Gebäude: Gebäude 6.0
Raum: Raum 516
Phone: +49 341 6025 4779
Email: jana.schor@ufz.de

Bio-Data Science Guppe

Meine Projekte

Curriculum Vitae

Aktuelle Position

Leitung Bio-Data Science Gruppe, Department Computational Biology & Chemistry

Professur

Bio-Data Science, Fakultät für Mathematik und Informatik, Universität Leipzig, Deutschland

Vergangene Position

Leitung Bioinformatik Gruppe, Department Integrative Bioinformatik

Wissenschaftliche Abschlüsse

Professur für Bio-Data Science (2024)
Dr. rer. nat. Informatik / Bioinformatik (2008)
Diplom Informatik (2005)

Forschung:

Meine Forschung entwickelt Bio-Data-Science- und KI-Methoden für die Umwelt- und Gesundheitsforschung weiter, mit einem besonderen Fokus auf transparente, glaubwürdige und praktisch nutzbare computergestützte Ansätze für komplexe wissenschaftliche Daten. Ich arbeite an der Schnittstelle von Datenintegration, maschinellem Lernen, graphbasierter KI und domänenspezifisch geerdeten großen Sprachmodellen, um neue Wege für die Analyse, Interpretation und Zugänglichmachung heterogener Daten in den Umwelt- und Lebenswissenschaften zu schaffen.

Ein zentrales Ziel meiner Arbeit ist es, fragmentierte und großskalige Daten in strukturierte, abfragbare und wissenschaftlich nutzbare Wissensbestände zu überführen. Dazu entwickle und nutze ich Methoden aus dem statistischen Lernen, dem maschinellen Lernen, dem Deep Learning und der Wissensrepräsentation, mit besonderem Schwerpunkt auf der Integration unterschiedlicher Datenquellen, Modalitäten sowie biologischer und umweltbezogener Organisationsebenen. Meine Forschung unterstützt sowohl die prädiktive Modellierung als auch die Generierung neuer Hypothesen, insbesondere in ökologischen, toxikologischen und gesundheitsbezogenen Kontexten.

Ein wichtiger Schwerpunkt meiner Arbeit liegt auf der Entwicklung vertrauenswürdiger KI. Deshalb lege ich großen Wert auf Erklärbarkeit, Unsicherheitsquantifizierung und reproduzierbare Forschungsworkflows, um sicherzustellen, dass computergestützte Ergebnisse transparent, robust und für Wissenschaft und Entscheidungsfindung nutzbar sind. In jüngerer Zeit arbeite ich zudem intensiv an agentischen KI-Systemen und domänenspezifischen Anwendungen großer Sprachmodelle, insbesondere dort, wo diese Modelle in strukturierte wissenschaftliche Wissensbestände eingebettet werden, um nachvollziehbare und zugängliche Schnittstellen zu komplexen Daten bereitzustellen.

Datenintegration, semantische Modellierung und Analyse mit Wissensgraphen und Graphdatenbanken
Graphenbasiertes maschinelles Lernen, einschließlich Graph Neural Networks für komplexe und vernetzte wissenschaftliche Daten
Erklärbare KI und Unsicherheitsquantifizierung für glaubwürdigere computergestützte Vorhersagen
Grounded LLMs und agentische KI für transparente, domänenspezifische Zugänge zu wissenschaftlichem Wissen
Reproduzierbare und skalierbare computergestützte Workflows für die Umwelt- und Lebenswissenschaften

Infrastruktur, Programme und Ansätze:

Hochleistungsrechencluster für die großskalige Datenverarbeitung.
KI-Training auf GPUs zur Beschleunigung der Modellleistung.
Graph- und andere neuronale Netzwerke für komplexe, vernetzte Datenstrukturen (für überwachte, unüberwachte und verstärkende Lernaufgaben).
Wissensgraphen und Graphdatenbanken zur Datenorganisation und semantischen Beziehungen.
Große Sprachmodelle zur Verbesserung der Interpretierbarkeit und Anwendungen in der Forschung.
Programmiersprachen wie R, Python, Shell-Scripting, Awk, Cypher und SQL für vielseitige Datenmanipulation und -analyse.

Lehre und Bildungsangebote:

Zusätzlich zu meiner Forschung widme ich mich der Ausbildung zukünftiger Datenwissenschaftler und Informatikstudierender. An der Universität Leipzig biete ich Kurse in statistischem Lernen, R-Programmierung und ein interaktives Data-Science-Curriculum an, das darauf abzielt, die Studierenden umfassend auf das Fachgebiet vorzubereiten. Diese Kurse umfassen:

Praktische Schulungen in R und Python,
Versionskontrolle mit Git,
Agile Projekt- und Selbstmanagement-Praktiken,
Storytelling mit Daten,
Erstellung überzeugender und repräsentativer Visualisierungen und
Entwicklung starker Präsentationsfähigkeiten.
Mein Ziel ist es, die Studierenden mit einem soliden, praxisorientierten Kompetenzset auszustatten, das sie auf eine erfolgreiche Karriere in realen Data-Science-Rollen vorbereitet.

We have been offered to participate in Building a Better World With Connected Data via the Graphs4Good initiative by neo4j

Helmholtz AI Associate

Host Supervisor of the HIDA Trainee Network

ResearcherID

ORCID

Publikationen

Meine fünf wichtigen rezenten Publikationen sind nach Relevanz sortiert:

Schor, J., Scheibe, P., Bernt, M., Busch, W., Lai, C., Hackermüller, J. (2022):
AI for predicting chemical-effect associations at the chemical universe level — deepFPlearn
Brief. Bioinform. 23 (5), bbac257 10.1093/bib/bbac257
Soulios, K., Scheibe, P., Bernt, M., Hackermüller, J., Schor, J. (2023):
deepFPlearn⁺: enhancing toxicity prediction across the chemical universe using graph neural networks
Bioinformatics 39 (12), btad713 10.1093/bioinformatics/btad713
Canzler, S., Schor, J., Busch, W., Schubert, K., Rolle-Kampczyk, U.E., Seitz, H., Kamp, H., von Bergen, M., Buesen, R., Hackermüller, J. (2020):
Prospects and challenges of multi‑omics data integration in toxicology
Arch. Toxicol. 94 (2), 371 - 388 10.1007/s00204-020-02656-y
Völkner, M., Wagner, F., Steinheuer, L.M., Carido, M., Kurth, T., Yazbeck, A., Schor, J., Wieneke, S., Ebner, L.J.A., Del Toro Runzer, C., Taborsky, D., Zoschke, K., Vogt, M., Canzler, S., Hermann, A., Khattak, S., Hackermüller, J., Karl, M.O. (2022):
HBEGF-TNF induce a complex outer retinal pathology with photoreceptor cell extrusion in human organoids
Nat. Commun. 13 , art. 6183 10.1038/s41467-022-33848-y
Gutsfeld, S., Wehmas, L., Omoyeni, I.E., Schweiger, N., Leuthold, D., Michaelis, P., Howey, X.M., Gaballah, S., Herold, N., Vogs, C., Wood, C., Bertotto, L., Wu, G.-M., Klüver, N., Busch, W., Scholz, S., Schor, J., Tal, T. (2024):
Investigation of peroxisome proliferator-activated receptor genes as requirements for visual startle response hyperactivity in larval zebrafish exposed to structurally similar per- and polyfluoroalkyl substances (PFAS)
Environ. Health Perspect. 132 (7), art. 077007 10.1289/EHP13667

Weiterführende Recherchen können Sie in unserem Publikationsverzeichnis durchführen.

2026 (2)

Canzler, S., Lehmann, J., Schor, J., Busch, W., Iacono, G., Hackermüller, J. (2026):
From toxicogenomics data to cumulative assessment groups: a framework for chemical grouping
Arch. Toxicol. 100 (1), 173 - 191
10.1007/s00204-025-04133-w
Spath, J., Raab, J., Schor, J., Scholz, S., Gutsfeld, S., Tal, T. (2026):
The zebrafish visual and acoustic motor response (VAMR) assay has the potential to add value to the developmental neurotoxicity in vitro battery (DNT IVB)
NeuroToxicology 114 , art. 103414
10.1016/j.neuro.2026.103414

zum Inhalt

2025 (12)

Canzler, S., Lehmann, J., Schor, J., Busch, W., Iacono, G., Hackermüller, J. (2025):
From toxicogenomics data to cumulative assessment groups: A framework for chemical grouping
Toxicol. Lett. 411 (Supplement), S55 - S56
10.1016/j.toxlet.2025.07.163
Gad, M., Tayyebi Sabet Khomami, N., Krieg, R., Schor, J., Philippe, A., Lechtenfeld, O.J. (2025):
Environmental drivers of dissolved organic matter composition across central European aquatic systems: A novel correlation-based machine learning and FT-ICR MS approach
Water Res. 273 , art. 123018
10.1016/j.watres.2024.123018
Mutlu, İ., Hackermüller, J., Schor, J. (2025):
Automated curation of spatial metadata in environmental monitoring data
Ecol. Inform. 86 , art. 103038
10.1016/j.ecoinf.2025.103038
Mutlu, İ., Schor, J., Hackermüller, J. (2025):
CleanGeoStreamR: Automated curation of spatially annotated environmental monitoring data
Version: v1 Zenodo
10.5281/zenodo.16880619
Scheibe, P., Schor, J. (2025):
AI-driven science communication: Leveraging LLMs and knowledge graphs for seamless knowledge exchange
bioRxiv
10.1101/2025.07.04.663152
Schor, J., Bohring, H., Scheibe, P. (2025):
EcoToxFred - Dialogues with a Knowledge Keeper
Zenodo
10.5281/zenodo.15696402
Schor, J., Mutlu, İ., Busch, W., Bumberger, J., Schulze, T., Ulrich, N., Hassan, K.N., Krauss, M., Brack, W., Doan, T., Bingert, S., Hackermüller, J. (2025):
Neo4j Container Link for the Curated Dataset in ChEOS Project [Data set]
Zenodo
10.5281/zenodo.14616124
Schor, J., Scheibe, P., Bernt, M., Hackermüller, J. (2025):
deepFPlearn - AI for predicting chemical-effect associations at the chemical universe level
Version: v1.2 Zenodo
10.5281/zenodo.13329413
Schor, J., Schulze, T., Ulrich, N., Mutlu, İ., Krauss, M., Brack, W., Doan, T., Bingert, S., Bumberger, J., Busch, W., Hackermüller, J. (2025):
Chemical mixture risk drivers and their heterogeneity in European freshwaters
Environ. Int. 205 , art. 109881
10.1016/j.envint.2025.109881
Schor, J., Schulze, T., Ulrich, N., Mutlu, İ., Krauss, M., Brack, W., Doan, T., Bingert, S., Bumberger, J., Busch, W., Hackermüller, J. (2025):
Chemical mixture risk drivers and their heterogeneity in European freshwaters - supplementary data - v1
Zenodo
10.5281/zenodo.15804639
Schor, J., Schulze, T., Ulrich, N., Mutlu, İ., Krauss, M., Brack, W., Doan, T., Bingert, S., Bumberger, J., Busch, W., Hackermüller, J. (2025):
Chemical mixture risk drivers and their heterogeneity in European freshwaters - supplementary data - v3
Zenodo
10.5281/zenodo.17249758
Schor, J., Schulze, T., Ulrich, N., Mutlu, İ., Krauss, M., Brack, W., Doan, T., Bingert, S., Bumberger, J., Busch, W., Hackermüller, J. (2025):
Chemical Mixture Risk Drivers in European Freshwaters: Analysis Software and Workflow
Version: v1 Zenodo
10.5281/zenodo.17909110

zum Inhalt

2024 (5)

Aldehoff, A.S., Karkossa, I., Goerdeler, C., Krieg, L., Schor, J., Engelmann, B., Wabitsch, M., Landgraf, K., Hackermüller, J., Körner, A., Rolle-Kampczyk, U., Schubert, K., von Bergen, M. (2024):
Unveiling the dynamics of acetylation and phosphorylation in SGBS and 3T3-L1 adipogenesis
iScience 27 (6), art. 109711
10.1016/j.isci.2024.109711
Gutsfeld, S., Wehmas, L., Omoyeni, I.E., Schweiger, N., Leuthold, D., Michaelis, P., Howey, X.M., Gaballah, S., Herold, N., Vogs, C., Wood, C., Bertotto, L., Wu, G.-M., Klüver, N., Busch, W., Scholz, S., Schor, J., Tal, T. (2024):
Investigation of peroxisome proliferator-activated receptor genes as requirements for visual startle response hyperactivity in larval zebrafish exposed to structurally similar per- and polyfluoroalkyl substances (PFAS)
Environ. Health Perspect. 132 (7), art. 077007
10.1289/EHP13667
Mutlu, İ., Schor, J. (2024):
Collection of chemical occurance data from NORMAN surface water database and other related files
Zenodo
10.5281/zenodo.11395194
Mutlu, İ., Schor, J. (2024):
Chemical occurence data curation results with CleanGeoStreamR
Zenodo
10.5281/zenodo.13799955
Schor, J. (2024):
deepFPlearn - datasets, models, and configurations
Zenodo
10.5281/zenodo.14409985

zum Inhalt

2023 (3)

Schor, J., Scheibe, P., Soulios, K., Bernt, M. (2023):
deepFPlearn +: enhancing toxicity prediction across the chemical universe using graph neural networks
Version: v2.0 Zenodo
10.5281/zenodo.13329443
Soulios, K., Scheibe, P., Bernt, M., Hackermüller, J., Schor, J. (2023):
deepFPlearn⁺: enhancing toxicity prediction across the chemical universe using graph neural networks
Bioinformatics 39 (12), btad713
10.1093/bioinformatics/btad713
Soulios, K., Scheibe, P., Bernt, M., Hackermüller, J., Schor, J. (2023):
deepFPlearn+
Zenodo
10.5281/zenodo.8146252

zum Inhalt

2022 (5)

Schor, J. (2022):
deepFPlearn - generic autoencoder model
Zenodo
10.5281/zenodo.14393931
Schor, J., Scheibe, P., Bernt, M., Busch, W., Lai, C., Hackermüller, J. (2022):
AI for predicting chemical-effect associations at the chemical universe level — deepFPlearn
Brief. Bioinform. 23 (5), bbac257
10.1093/bib/bbac257
Schor, J., Yazbeck, A.M, Völkner, M., Hackermüller, J., Karl, M.O. (2022):
HBEGF-TNF induces a complex retinal pathology with macular degeneration hallmarks in human organoids
Gene Expression Omnibus
Velandia-Huerto, C.A., Yazbeck, A.M., Schor, J., Stadler, P.F. (2022):
Evolution and phylogeny of microRNAs — Protocols, pitfalls, and problems
In: Allmer, J., Yousef, M. (eds.)
miRNomics. MicroRNA biology and computational analysis
Methods in Molecular Biology 2257
Springer Nature, p. 211 - 233
10.1007/978-1-0716-1170-8_11
Völkner, M., Wagner, F., Steinheuer, L.M., Carido, M., Kurth, T., Yazbeck, A., Schor, J., Wieneke, S., Ebner, L.J.A., Del Toro Runzer, C., Taborsky, D., Zoschke, K., Vogt, M., Canzler, S., Hermann, A., Khattak, S., Hackermüller, J., Karl, M.O. (2022):
HBEGF-TNF induce a complex outer retinal pathology with photoreceptor cell extrusion in human organoids
Nat. Commun. 13 , art. 6183
10.1038/s41467-022-33848-y

zum Inhalt

Inhalt:

2021 (2)
2020 (3)
2019 (3)
2018 (1)
2017 (2)
2016 (5)

Weiterführende Recherchen können Sie in unserem Publikationsverzeichnis durchführen.

2021 (2)

Schubert, K., Karkossa, I., Schor, J., Engelmann, B., Steinheuer, L.M., Bruns, T., Rolle-Kampczyk, U., Hackermüller, J., von Bergen, M. (2021):
A multi-omics analysis of mucosal-associated-invariant T cells reveals key drivers of distinct modes of activation
Front. Immunol. 12 , art. 616967
10.3389/fimmu.2021.616967
Völkner, M., Kurth, T., Schor, J., Ebner, L.J.A., Bardtke, L., Kavak, C., Hackermüller, J., Karl, M.O. (2021):
Mouse retinal organoid growth and maintenance in longer-term culture
Front. Cell. Dev. Biol. 9 , art. 645704
10.3389/fcell.2021.645704

zum Inhalt

2020 (3)

Balogh, G., Bernhart, S.H., Stadler, P.F., Schor, J. (2020):
A probabilistic version of Sankoff’s maximum parsimony algorithm
J. Bioinform. Comput. Biol. 18 (1), art. 2050004
10.1142/S0219720020500043
Canzler, S., Schor, J., Busch, W., Schubert, K., Rolle-Kampczyk, U.E., Seitz, H., Kamp, H., von Bergen, M., Buesen, R., Hackermüller, J. (2020):
Prospects and challenges of multi‑omics data integration in toxicology
Arch. Toxicol. 94 (2), 371 - 388
10.1007/s00204-020-02656-y
Olms, C., Schor, J., Yahiaoui-Doktor, M. (2020):
Potential co-factors of an intraoral contact allergy — A cross-sectional study
Dent. J. 8 (3), art. 83
10.3390/dj8030083

zum Inhalt

2019 (3)

Bauer, M., Hackermüller, J., Schor, J., Schreiber, S., Fink, B., Pierzchalski, A., Herberth, G. (2019):
Specific induction of the unique GPR15 expression in heterogeneous blood lymphocytes by tobacco smoking
Biomarkers 24 (3), 217 - 224
10.1080/1354750X.2018.1539769
Kämpf, C., Specht, M., Scholz, A., Puppel, S.-H., Doose, G., Reiche, K., Schor, J., Hackermüller, J. (2019):
uap: reproducible and robust HTS data analysis
BMC Bioinformatics 20 , art. 664
10.1186/s12859-019-3219-1
Schubert, K., Schor, J., Kratochvil, I., Hackermüller, J., von Bergen, M. (2019):
Multi-omics profiling of MAIT cells reveals specific patterns of antigen-dependent and -independent activation
Eur. J. Immunol. 49 (Suppl. 1), 141
10.1002/eji.201970300

zum Inhalt

2018 (1)

Canzler, S., Stadler, P.F., Schor, J. (2018):
The fungal snoRNAome
RNA 24 (3), 342 - 360
10.1261/rna.062778.117

zum Inhalt

2017 (2)

Canzler, S., Stadler, P.F., Hertel, J. (2017):
Evolution of fungal U3 snoRNAs: Structural variation and introns
Non-Coding RNA 3 (1), art. 3
10.3390/ncrna3010003
Yazbeck, A.M., Tout, K.R., Stadler, P.F., Hertel, J. (2017):
Towards a consistent, quantitative evaluation of microRNA evolution
J. Integr. Bioinform. 14 (1)
10.1515/jib-2016-0013

zum Inhalt

2016 (5)

Bhattacharya, D.P., Canzler, S., Kehr, S., Hertel, J., Grosse, I., Stadler, P.F. (2016):
Phylogenetic distribution of plant snoRNA families
BMC Genomics 17 , art. 969
10.1186/s12864-016-3301-2
Braasch, I., Gehrke, A.R., Smith, J.J., Kawasaki, K., Manousaki, T., Pasquier, J., Amores, A., Desvignes, T., Batzel, P., Catchen, J., Berlin, A.M., Campbell, M.S., Barrell, D., Martin, K.J., Mulley, J.F., Ravi, V., Lee, A.P., Nakamura, T., Chalopin, D., Fan, S., Wcisel, D., Cañestro, C., Sydes, J., Beaudry, F.E.G., Sun, Y., Hertel, J., Beam, M.J., Fasold, M., Ishiyama, M., Johnson, J., Kehr, S., Lara, M., Letaw, J.H., Litman, G.W., Litman, R.T., Mikami, M., Ota, T., Saha, N.R., Williams, L., Stadler, P.F., Wang, H., Taylor, J.S., Fontenot, Q., Ferrara, A., Searle, S.M.J., Aken, B., Yandell, M., Schneider, I., Yoder, J.A., Volff, J.-N., Meyer, A., Amemiya, C.T., Venkatesh, B., Holland, P.W.H., Guiguen, Y., Bobe, J., Shubin, N.H., Di Palma, F., Alföldi, J., Lindblad-Toh, K., Postlethwait, J.H. (2016):
The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons
Nature Genet. 48 (4), 427 - 437
10.1038/ng.3526
Canzler, S., Stadler, P.F., Hertel, J. (2016):
U6 snRNA intron insertion occurred multiple times during fungi evolution
RNA Biol. 13 (2), 119 - 127
10.1080/15476286.2015.1132139
de Araujo Oliveira, J.V., Costa, F., Backofen, R., Stadler, P.F., Walter, M.E.M.T., Hertel, J. (2016):
SnoReport 2.0: new features and a refined Support Vector Machine to improve snoRNA identification
BMC Bioinformatics 17 (Suppl 18), art. 464
10.1186/s12859-016-1345-6
Fontenot, Q., Ferrara, A., Searle, S.M.J., Aken, B., Yandell, M., Schneider, I., Yoder, J.A., Volff, J.-N., Meyer, A., Amemiya, C.T., Venkatesh, B., Holland, P.W.H., Guiguen, Y., Bobe, J., Shubin, N.H., Di Palma, F., Alföldi, J., Lindblad-Toh, K., Postlethwait, J.H., Martin, K.J., Mulley, J.F., Ravi, V., Lee, A.P., Nakamura, T., Chalopin, D., Fan, S., Wcisel, D., Cañestro, C., Sydes, J., Beaudry, F.E.G., Sun, Y., Hertel, J., Beam, M.J., Fasold, M., Ishiyama, M., Johnson, J., Kehr, S., Lara, M., Letaw, J.H., Litman, G.W., Litman, R.T., Mikami, M., Ota, T., Saha, N.R., Williams, L., Stadler, P.F., Wang, H., Taylor, J.S., Braasch, I., Gehrke, A.R., Smith, J.J., Kawasaki, K., Manousaki, T., Pasquier, J., Amores, A., Desvignes, T., Batzel, P., Catchen, J., Berlin, A.M., Campbell, M.S., Barrell, D. (2016):
Corrigendum: The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons (vol 48, pg 427, 2016)
Nature Genet. 48 (6), 700 - 700
10.1038/ng0616-700c

zum Inhalt

Für ältere Publikationen entstanden an der Uni Leipzig und/oder Uni Wien unter meinem Mädchennamen Jana Hertel

Professur für Bioinformatik
Institut für Informatik
Universität Leipzig
Härtelstr. 16-18
D-04107 Leipzig

Institut für Theoretische Chemie
Universität Wien
Währinger Straße 17
A-1090 Wien

schauen Sie bitte in meinem ORCID Profil.