<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Projects | Ioannis Mouratidis</title><link>https://ioannis-mouratidis.github.io/projects/</link><atom:link href="https://ioannis-mouratidis.github.io/projects/index.xml" rel="self" type="application/rss+xml"/><description>Projects</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Sun, 19 May 2024 00:00:00 +0000</lastBuildDate><image><url>https://ioannis-mouratidis.github.io/media/icon_hu_899445b689d8f445.png</url><title>Projects</title><link>https://ioannis-mouratidis.github.io/projects/</link></image><item><title>Genomic Data Compression Tool</title><link>https://ioannis-mouratidis.github.io/projects/compression-tool/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://ioannis-mouratidis.github.io/projects/compression-tool/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;A novel compression tool developed in C++ and Python specifically optimized for multiple genomic file formats. This tool significantly reduces storage requirements while dramatically improving compression speed compared to existing solutions.&lt;/p&gt;
&lt;h2 id="performance"&gt;Performance&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;10-20% smaller file sizes&lt;/strong&gt; compared to standard genomic compression tools&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;50-70% faster compression times&lt;/strong&gt; enabling real-time analysis&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Multiple format support&lt;/strong&gt;: Handles various genomic data formats&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Lossless compression&lt;/strong&gt;: Maintains data integrity for scientific applications&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="technical-approach"&gt;Technical Approach&lt;/h2&gt;
&lt;p&gt;The tool leverages domain-specific knowledge about genomic data structure to achieve superior compression ratios and speeds. Implementation in C++ provides low-level performance optimization while Python bindings enable easy integration into bioinformatics pipelines.&lt;/p&gt;
&lt;h2 id="impact"&gt;Impact&lt;/h2&gt;
&lt;p&gt;This compression tool enables:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Reduced storage costs for large-scale genomic projects&lt;/li&gt;
&lt;li&gt;Faster data transfer and backup operations&lt;/li&gt;
&lt;li&gt;Real-time compression for sequencing pipelines&lt;/li&gt;
&lt;li&gt;More efficient cloud-based genomic analysis&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="applications"&gt;Applications&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Large-scale sequencing projects&lt;/li&gt;
&lt;li&gt;Genomic data archiving&lt;/li&gt;
&lt;li&gt;Cloud-based bioinformatics platforms&lt;/li&gt;
&lt;li&gt;Real-time sequencing data processing&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Zseeker</title><link>https://ioannis-mouratidis.github.io/projects/zseeker/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://ioannis-mouratidis.github.io/projects/zseeker/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Zseeker is an open-source Python tool designed for optimized detection of Z-DNA forming sequences in large genomic datasets. Z-DNA is an alternative left-handed DNA structure implicated in gene regulation, genome instability, and various biological processes.&lt;/p&gt;
&lt;h2 id="features"&gt;Features&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;High Performance&lt;/strong&gt;: Optimized algorithms enable analysis of entire genomes&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Accuracy&lt;/strong&gt;: Improved detection accuracy compared to previous methods&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Scalability&lt;/strong&gt;: Designed to handle large-scale genomic datasets&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Easy to Use&lt;/strong&gt;: Simple Python interface with clear documentation&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Open Source&lt;/strong&gt;: Freely available for academic and commercial use&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="technical-details"&gt;Technical Details&lt;/h2&gt;
&lt;p&gt;Zseeker implements advanced algorithms for identifying sequences capable of forming Z-DNA structures based on sequence composition and thermodynamic properties. The tool is optimized for both speed and accuracy, making it suitable for genome-wide analyses.&lt;/p&gt;
&lt;h2 id="use-cases"&gt;Use Cases&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Genome-wide Z-DNA mapping&lt;/li&gt;
&lt;li&gt;Regulatory element identification&lt;/li&gt;
&lt;li&gt;Genome stability studies&lt;/li&gt;
&lt;li&gt;Comparative genomics of non-B DNA structures&lt;/li&gt;
&lt;li&gt;Disease-associated variant analysis&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="publications"&gt;Publications&lt;/h2&gt;
&lt;p&gt;Wang, G., Mouratidis, I., Provatas, K., et al. (2025). ZSeeker: an optimized algorithm for Z-DNA detection in genomic sequences. &lt;em&gt;Briefings in Bioinformatics, 26&lt;/em&gt;(3).&lt;/p&gt;</description></item><item><title>kmerDB</title><link>https://ioannis-mouratidis.github.io/projects/kmerdb/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://ioannis-mouratidis.github.io/projects/kmerdb/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;kmerDB is a comprehensive database that consolidates genomic and proteomic k-mer sequence information across all species in Genbank and UniProt. This resource enables rapid species identification, comparative genomic studies, and evolutionary analysis.&lt;/p&gt;
&lt;h2 id="features"&gt;Features&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Comprehensive Coverage&lt;/strong&gt;: Encompasses k-mer data from all species in major sequence databases&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Dual Coverage&lt;/strong&gt;: Includes both genomic (DNA) and proteomic (amino acid) sequences&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Fast Queries&lt;/strong&gt;: Optimized data structures enable rapid k-mer lookups&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Species Identification&lt;/strong&gt;: Enables efficient molecular diagnostics and species authentication&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;100-fold Compression&lt;/strong&gt;: Novel compression procedures reduce data storage requirements dramatically&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="technical-implementation"&gt;Technical Implementation&lt;/h2&gt;
&lt;p&gt;The database was built using advanced compression algorithms achieving 100-fold data reduction while maintaining query performance. This enables storage and analysis of k-mer information from the entire tree of life.&lt;/p&gt;
&lt;h2 id="applications"&gt;Applications&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Species identification and authentication&lt;/li&gt;
&lt;li&gt;Comparative genomics&lt;/li&gt;
&lt;li&gt;Evolutionary studies&lt;/li&gt;
&lt;li&gt;Molecular diagnostics&lt;/li&gt;
&lt;li&gt;Environmental monitoring&lt;/li&gt;
&lt;li&gt;Food authentication&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="publications"&gt;Publications&lt;/h2&gt;
&lt;p&gt;Mouratidis, I., Baltoumas, F. A., Chantzi, N., et al. (2024). kmerDB: A database encompassing the set of genomic and proteomic sequence information for each species. &lt;em&gt;Computational and Structural Biotechnology Journal, 23&lt;/em&gt;.&lt;/p&gt;</description></item><item><title>Neomer Diagnostics</title><link>https://ioannis-mouratidis.github.io/projects/neomer-diagnostics/</link><pubDate>Sat, 01 Jan 2022 00:00:00 +0000</pubDate><guid>https://ioannis-mouratidis.github.io/projects/neomer-diagnostics/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Co-founded Neomer Diagnostics in 2022 as Chief Technical Officer to translate patented nullomer research into a clinical cancer detection platform. The company developed machine learning pipelines for detecting cancer from liquid biopsies.&lt;/p&gt;
&lt;h2 id="role--achievements"&gt;Role &amp;amp; Achievements&lt;/h2&gt;
&lt;p&gt;As CTO, I:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Developed ML pipeline in Bash, Julia, Python, and Slurm for cancer detection from liquid biopsies&lt;/li&gt;
&lt;li&gt;Achieved AUC ranging from 0.89 to 0.94 in lung and ovarian cancers&lt;/li&gt;
&lt;li&gt;Established regulatory roadmap for clinical validation and FDA approval&lt;/li&gt;
&lt;li&gt;Secured $850K in translational research funding&lt;/li&gt;
&lt;li&gt;Led technical team and coordinated with clinical partners&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="technology"&gt;Technology&lt;/h2&gt;
&lt;p&gt;The platform leveraged sequences absent from the human genome (nullomers) as biomarkers for cancer detection. Machine learning models were trained on cell-free DNA and RNA data from liquid biopsies to distinguish cancer patients from healthy controls.&lt;/p&gt;
&lt;h2 id="clinical-applications"&gt;Clinical Applications&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Early cancer detection&lt;/li&gt;
&lt;li&gt;Cancer screening in high-risk populations&lt;/li&gt;
&lt;li&gt;Monitoring treatment response&lt;/li&gt;
&lt;li&gt;Detecting minimal residual disease&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="funding--recognition"&gt;Funding &amp;amp; Recognition&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Secured $850K in translational research funding&lt;/li&gt;
&lt;li&gt;Patent portfolio covering nullomer-based diagnostics&lt;/li&gt;
&lt;li&gt;Partnerships with clinical institutions&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="period"&gt;Period&lt;/h2&gt;
&lt;p&gt;January 2022 - May 2023&lt;/p&gt;</description></item></channel></rss>