<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Genomics | Ioannis Mouratidis</title><link>https://ioannis-mouratidis.github.io/tags/Genomics/</link><atom:link href="https://ioannis-mouratidis.github.io/tags/Genomics/index.xml" rel="self" type="application/rss+xml"/><description>Genomics</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Wed, 01 Jan 2025 00:00:00 +0000</lastBuildDate><image><url>https://ioannis-mouratidis.github.io/media/icon_hu_899445b689d8f445.png</url><title>Genomics</title><link>https://ioannis-mouratidis.github.io/tags/Genomics/</link></image><item><title>Genomic Data Compression Tool</title><link>https://ioannis-mouratidis.github.io/projects/compression-tool/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://ioannis-mouratidis.github.io/projects/compression-tool/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;A novel compression tool developed in C++ and Python specifically optimized for multiple genomic file formats. This tool significantly reduces storage requirements while dramatically improving compression speed compared to existing solutions.&lt;/p&gt;
&lt;h2 id="performance"&gt;Performance&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;10-20% smaller file sizes&lt;/strong&gt; compared to standard genomic compression tools&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;50-70% faster compression times&lt;/strong&gt; enabling real-time analysis&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Multiple format support&lt;/strong&gt;: Handles various genomic data formats&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Lossless compression&lt;/strong&gt;: Maintains data integrity for scientific applications&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="technical-approach"&gt;Technical Approach&lt;/h2&gt;
&lt;p&gt;The tool leverages domain-specific knowledge about genomic data structure to achieve superior compression ratios and speeds. Implementation in C++ provides low-level performance optimization while Python bindings enable easy integration into bioinformatics pipelines.&lt;/p&gt;
&lt;h2 id="impact"&gt;Impact&lt;/h2&gt;
&lt;p&gt;This compression tool enables:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Reduced storage costs for large-scale genomic projects&lt;/li&gt;
&lt;li&gt;Faster data transfer and backup operations&lt;/li&gt;
&lt;li&gt;Real-time compression for sequencing pipelines&lt;/li&gt;
&lt;li&gt;More efficient cloud-based genomic analysis&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="applications"&gt;Applications&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Large-scale sequencing projects&lt;/li&gt;
&lt;li&gt;Genomic data archiving&lt;/li&gt;
&lt;li&gt;Cloud-based bioinformatics platforms&lt;/li&gt;
&lt;li&gt;Real-time sequencing data processing&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title>Leveraging sequences missing from the human genome to diagnose cancer</title><link>https://ioannis-mouratidis.github.io/publications/cancer-detection-2025/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://ioannis-mouratidis.github.io/publications/cancer-detection-2025/</guid><description>&lt;p&gt;This work demonstrates how sequences that are absent from the human genome can be leveraged as biomarkers for cancer detection, with potential applications in liquid biopsy-based diagnostics.&lt;/p&gt;</description></item><item><title>ZSeeker: an optimized algorithm for Z-DNA detection in genomic sequences</title><link>https://ioannis-mouratidis.github.io/publications/zseeker-2025/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://ioannis-mouratidis.github.io/publications/zseeker-2025/</guid><description>&lt;p&gt;ZSeeker provides researchers with a fast and accurate tool for identifying Z-DNA forming sequences across entire genomes, facilitating studies of genome regulation and stability.&lt;/p&gt;</description></item><item><title>kmerDB</title><link>https://ioannis-mouratidis.github.io/projects/kmerdb/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://ioannis-mouratidis.github.io/projects/kmerdb/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;kmerDB is a comprehensive database that consolidates genomic and proteomic k-mer sequence information across all species in Genbank and UniProt. This resource enables rapid species identification, comparative genomic studies, and evolutionary analysis.&lt;/p&gt;
&lt;h2 id="features"&gt;Features&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Comprehensive Coverage&lt;/strong&gt;: Encompasses k-mer data from all species in major sequence databases&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Dual Coverage&lt;/strong&gt;: Includes both genomic (DNA) and proteomic (amino acid) sequences&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Fast Queries&lt;/strong&gt;: Optimized data structures enable rapid k-mer lookups&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Species Identification&lt;/strong&gt;: Enables efficient molecular diagnostics and species authentication&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;100-fold Compression&lt;/strong&gt;: Novel compression procedures reduce data storage requirements dramatically&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="technical-implementation"&gt;Technical Implementation&lt;/h2&gt;
&lt;p&gt;The database was built using advanced compression algorithms achieving 100-fold data reduction while maintaining query performance. This enables storage and analysis of k-mer information from the entire tree of life.&lt;/p&gt;
&lt;h2 id="applications"&gt;Applications&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Species identification and authentication&lt;/li&gt;
&lt;li&gt;Comparative genomics&lt;/li&gt;
&lt;li&gt;Evolutionary studies&lt;/li&gt;
&lt;li&gt;Molecular diagnostics&lt;/li&gt;
&lt;li&gt;Environmental monitoring&lt;/li&gt;
&lt;li&gt;Food authentication&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="publications"&gt;Publications&lt;/h2&gt;
&lt;p&gt;Mouratidis, I., Baltoumas, F. A., Chantzi, N., et al. (2024). kmerDB: A database encompassing the set of genomic and proteomic sequence information for each species. &lt;em&gt;Computational and Structural Biotechnology Journal, 23&lt;/em&gt;.&lt;/p&gt;</description></item><item><title>kmerDB: A database encompassing the set of genomic and proteomic sequence information for each species</title><link>https://ioannis-mouratidis.github.io/publications/kmerdb-2024/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://ioannis-mouratidis.github.io/publications/kmerdb-2024/</guid><description>&lt;p&gt;kmerDB provides the research community with a powerful tool for k-mer based species identification, comparative genomics, and evolutionary studies.&lt;/p&gt;</description></item></channel></rss>