AI & ML interests

Historical Media Analysis and Enrichment

Recent Activity

Interdisciplinary ML‑powered platform for exploring historical periodical media.

  • 📚 Corpus: Aggregates an unprecedented multilingual archive of newspapers and radio across time and borders.
  • 🎯 Vision: Enables a semantic-enriched workflow for representation, exploration, and historical research across modalities like print and audio.
  • 💡 Outputs:
    • Web App & Datalab platforms for exploratory analysis, search and programmatic access
    • NLP resources: Language identificatino, OCR quality assessment, Named Entity Recognition, Named Entity Linking, topic models
    • Historical insights under the theme of media influences.
  • 🧑‍🤝‍🧑 Hugging Face Organization hosts multilingual NER, NEL, OCR‑quality assessment models, and Spaces for named entity processing