Experience

  1. Senior Research Engineer/Scientist Associate (Senior)

    University of Texas at Austin
    • Directed 10 research initiatives (4 independently, 6 collaboratively) resulting in 2 peer-reviewed publications with 2 under review and 6 in preparation
    • Lead 5-member team in developing, scaling and evaluating computational and ML methods in biological data
    • Trained genomic foundation models to evaluate their susceptibility to adversarial data poisoning attacks
    • Spearheaded novel compression tool in C++ and Python achieving 10-20% reduced file sizes and 50-70% faster compression times
    • Led benchmarking of generative genomic models against real genomes
  2. Research Associate & Research Assistant

    The Pennsylvania State University
    • Engineered novel algorithms in Python and Bash for k-mer analysis; 12 peer-reviewed publications (7 as first/corresponding author)
    • Developed kmerDB database consolidating DNA/protein sequences across Genbank and UniProt
    • Published Zseeker, an open-source Python tool for Z-DNA detection in large genomic datasets
    • Engineered ML pipelines for cancer detection from cfDNA and cfRNA based on liquid biopsies
    • Supervised 8+ researchers and mentored 5 junior researchers to their first lead-author publications
    • Organized workshop series around AI applications in biological research attended by 30+ participants
  3. Co-founder & Chief Technical Officer

    Neomer Diagnostics
    • Co-founded diagnostics startup translating patented nullomer research into clinical cancer detection platform
    • Developed ML pipeline in Bash, Julia, Python, and Slurm achieving AUC ranging from 0.89 to 0.94 in lung and ovarian cancers
    • Established regulatory roadmap for clinical validation and FDA approval
    • Secured $850K in translational research funding
  4. Software Engineering Consultant

    Independent Contractor
    • Optimized cancer genomics ML pipeline using Julia, achieving 50-fold speed increase, saving >3 months of computational time and $50K+ in costs
    • Created distributed computing pipelines for analysis of 100+TB of genomic data
    • Deployed and managed genomic analysis pipeline to Google Cloud Platform
    • Engineered compression procedure achieving 100-fold reduction, enabling development of kmerDB database

Education

  1. PhD in Bioinformatics and Genomics

    The Pennsylvania State University

    Doctoral thesis: Computational and Artificial Intelligence methods for large-scale omics data analysis

    GPA: 4.0/4.0

    Download CV
  2. M.Sc. in Artificial Intelligence

    KU Leuven

    Honours: Cum Laude

    Master’s thesis: Frequentmers – a k-mer based approach for the detection of liver cirrhosis gut microbiome mNGS

  3. B.Sc. in Mathematics

    Aristotle University of Thessaloniki
    Honours: Very Good
Skills & Hobbies
Technical Skills
Python

Pandas, NumPy, scikit-learn, PyTorch, Matplotlib

Machine Learning

AI evals, genomic Language Models, algorithm design and optimization

Bioinformatics

Large-scale genomic data analysis, k-mer methods, ML pipelines

Programming

Bash, Julia, C++, Java

Cloud & Tools

Google Cloud Platform, Git, Docker, Scientific Computing

Awards
Center for Molecular Carcinogenesis and Toxicology Symposium Abstract Award
Penn State College of Medicine ∙ Jan 2024
1st place award for outstanding research presentation
Alumni Society Award
Penn State College of Medicine ∙ Jan 2024
Awarded to top 5% of graduate students for outstanding research contributions
University Graduate Fellow
Penn State ∙ Jan 2023
Competitive fellowship awarded to top 1% of incoming graduate students
Huck Distinguished Fellow
Penn State Huck Institutes ∙ Jan 2023
Prestigious fellowship awarded to top 3% of graduate students in life sciences
Languages
100%
Greek Native
100%
English Fluent
100%
French Fluent
100%
German Fluent
40%
Japanese Elementary