🔐 Athens Diachronic Corpus

This is a private development site

Incorrect password. Please try again.
ATHENS CORPUS

Athens Diachronic Corpus

Exploring 3,000 Years of Greek Language Evolution Through AI

About the Project

ΕΛΙΔΕΚ-funded cutting-edge research in computational diachronic linguistics

🤖

AI-Powered Analysis

Leveraging state-of-the-art NLP and machine learning to trace semantic shifts, syntactic evolution, and morphological changes across millennia of Greek texts.

📚

Massive Corpus

Over 10 million tokens spanning from Linear B tablets to contemporary Greek social media, creating the most comprehensive diachronic Greek dataset.

🌐

Open Science

All data, tools, and findings freely available under CC-BY license. Interactive web interface for researchers worldwide to query and analyze our corpus.

Research Team

Interdisciplinary experts in linguistics, computer science, and digital humanities

👨‍🏫

Dr. [Principal Investigator]

Project Director

Computational linguist specializing in Greek historical morphology and semantic change detection algorithms.

👩‍🔬

Dr. [Postdoc Name]

Senior Researcher

Expert in Byzantine Greek and neural language models for historical text analysis.

👨‍💻

[PhD Student]

NLP Engineer

Developing transformer-based models for diachronic word embeddings and semantic drift visualization.

👩‍🎓

[Research Assistant]

Data Scientist

Corpus annotation, quality control, and development of the web-based query interface.

Project Timeline

Key milestones and deliverables

Project Launch & Team Assembly

Recruitment of research team, infrastructure setup, initial corpus design and annotation guidelines.

2024 Q1

Corpus Collection Phase I

Digitization of ancient and medieval texts, OCR optimization for polytonic Greek, initial quality control.

2024 Q3

AI Model Development

Training diachronic word embeddings, developing semantic change detection algorithms, first research papers.

2025 Q2

Public Interface Launch

Release of web-based corpus query system, API for computational access, first workshops for researchers.

2026 Q1

Project Completion

Final corpus release, comprehensive documentation, sustainability plan for long-term maintenance.

2027 Q4

Resources & Tools

Access our corpus, tools, and documentation

🔍

Corpus Search

Advanced query interface with linguistic annotations, temporal filters, and statistical analysis tools.

Coming Soon

API Access

RESTful API for programmatic access to the corpus, with Python and R client libraries.

Documentation
📊

Visualization Tools

Interactive visualizations of semantic change, word frequency evolution, and syntactic patterns.

Explore

Contact Us

Interested in collaboration or have questions about the project?