???global.info.a_carregar???
Diogo Pratas was born in Aveiro. He graduated in Information and Communication Technologies at the University of Aveiro, in 2008. During his degree, he participated in the Erasmus program in Computer Engineering at the Pontifical University of Salamanca, Spain. He worked in the private sector from 2008 to 2010 in the areas of Networks and IT security and in the development of Linux systems. In 2016, he obtained his PhD in Informatics at the University of Aveiro with a dissertation on compression and analysis of genomic sequences. He carried out postdoctoral research in Computer Science from 2016 to 2019 at the University of Aveiro. In 2019, he was a Staff Bioinformatician at the University of Helsinki, Finland. Since the end of 2019, he has been an auxiliary researcher at the University of Aveiro in the areas of Informatics, Bioinformatics, and Artificial Intelligence. Since 2022, he has been a visiting researcher at the Department of Virology at the University of Helsinki, Finland. Since 2019, he teaches Algorithmic Information Theory at the Department of Electronics, Telecommunications, and Informatics at the University of Aveiro. He has organized several scientific conferences, workshops, and competitions, including the International Conference on Algorithms for Computational Biology, the Iberian Conference on Pattern Recognition, the Workshop on Genomics for Physicians, and the Portuguese League of Bioinformatics. He actively participates in scientific associations, including the European Society for Clinical Virology (ESCV) and the Portuguese Association for Pattern Recognition (APRP), having served as secretary of the APRP (2018-2020). His main areas of interest and research are Bioinformatics, Computational Biology, and Information Theory. He developed extensive research on automatic pattern recognition to analyze and minimize the content of biological information, having focused, in particular, on the subject of Computational Virology. He has participated as a speaker at various international conferences and scientific meetings and is the author of several publications and articles in the areas of Informatics, Medicine, and Biology.
Identification

Personal identification

Full name
Diogo Pratas

Citation names

  • Pratas, Diogo

Author identifiers

Ciência ID
5A1C-F0F0-7E62
ORCID iD
0000-0003-1176-552X
Google Scholar ID
HasPwO0AAAAJ
Scopus Author Id
49361962700

Addresses

  • IEETA, Campus Universitário de Santiago, 3810-193, Aveiro, Aveiro, Portugal (Professional)

Websites

  • pratas.github.io (Scholar)

Knowledge fields

  • Exact Sciences - Computer and Information Sciences - Bioinformatics

Languages

Language Speaking Reading Writing Listening Peer-review
Portuguese (Mother tongue)
English Advanced (C1) Advanced (C1) Advanced (C1) Advanced (C1) Advanced (C1)
Spanish; Castilian Upper intermediate (B2) Upper intermediate (B2) Upper intermediate (B2) Upper intermediate (B2) Upper intermediate (B2)
French Intermediate (B1) Intermediate (B1) Intermediate (B1) Intermediate (B1) Intermediate (B1)
Education
Degree Classification
2011/09/01 - 2016/01/19
Concluded
Informática (Doutoramento)
Major in Bioinformática
Universidade de Aveiro, Portugal
"Compressão e análise de dados genómicos " (THESIS/DISSERTATION)
2004 - 2008
Concluded
Técnologias de Informação e Comunicação (Licenciatura)
Universidade de Aveiro, Portugal
Affiliation

Science

Category
Host institution
Employer
2022/06/01 - Current Visiting Researcher (Research) Helsingin Yliopisto, Finland
2019/08 - Current Auxiliary Researcher (Research) Universidade de Aveiro, Portugal
Universidade de Aveiro, Portugal
2019/03/17 - 2019/07/17 Contracted Researcher (Research) Helsingin Yliopisto, Finland
Helsingin yliopisto Haartman-instituutti, Finland

Other Careers

Category
Host institution
Employer
2009 - 2010 Consultor de Informática (Categorias e Funções Especificas) IPortalMais, Portugal
2008 - 2008 Técnico de Informática Estagiário (Técnico de informática) Dimension Data UK, United Kingdom

Others

Category
Host institution
Employer
2016/06/01 - 2019/03 Researcher of the project: “The normalized relative compression distance”. Universidade de Aveiro Instituto de Engenharia Eletrónica e Informática de Aveiro, Portugal
2015 - 2018 Researcher Universidade de Aveiro Instituto de Engenharia Eletrónica e Informática de Aveiro, Portugal
2012/11/01 - 2016/03/31 Researcher of the project: "RD-CONNECT – An integrated platform connecting registries, biobanks and clinical bioinformatics for rare disease research" Universidade de Aveiro Instituto de Engenharia Eletrónica e Informática de Aveiro, Portugal
2012/10/01 - 2013/12/31 IEETA Integrated member. Subject: Compression and analysis of genomic data Universidade de Aveiro Instituto de Engenharia Eletrónica e Informática de Aveiro, Portugal
2011 - 2013 Researcher Universidade de Aveiro Instituto de Engenharia Eletrónica e Informática de Aveiro, Portugal
2010/07/01 - 2012/09/30 Researcher of the project: "Analysis of DNA sequences through compression based complexity profiles" Universidade de Aveiro Instituto de Engenharia Eletrónica e Informática de Aveiro, Portugal
2009/12/01 - 2010/06/30 Researcher of the project: "Finite-context models for DNA" Universidade de Aveiro Instituto de Engenharia Eletrónica e Informática de Aveiro, Portugal
Projects

Contract

Designation Funders
2024 - 2028 The Human Tissue Virome - Comprehensive Impact Analysis #5
Not Applicable
Researcher
Jane ja Aatos Erkon Säätiö
Ongoing
2023 - 2027 Time, Place, and DNA - Ancient host-pathogen genomics in Finland
Not applicable
Researcher
Helsingin Yliopisto, Finland
Suomen Kulttuurirahasto
Ongoing
2021 - 2025 Molecular genetic time travel and ancient diseases
Not applicable
Researcher
Helsingin Yliopisto, Finland
Suomalainen Tiedeakatemia
Ongoing
2023 - 2024 The Human Tissue Virome - Comprehensive Impact Analysis #3
not applicable
Researcher
Helsingin Yliopisto, Finland
Finska Läkaresällskapet
Ongoing
2023 - 2024 The Human Tissue Virome - Comprehensive Impact Analysis #4
N/A
Researcher
Helsingin Yliopisto, Finland
Suomen Lääketieteen Säätiö
Ongoing
2021 - 2024 Levänluhta and Käldamäki water burials – will molecules and isotopes solve an Iron Age mystery?
Not applicable
Researcher
Helsingin Yliopisto, Finland
Koneen Säätiö
Ongoing
2020 - 2024 The Human Tissue Virome - Comprehensive Impact Analysis #2
not applicable
Researcher
Helsingin Yliopisto, Finland
Medicinska Understödsföreningen Liv och Hälsa rf
Ongoing
2020 - 2024 Levänluhta and Käldamäki water burials – will molecules and isotopes solve an Iron Age mystery?
not applicable
Researcher
Kuvataideakatemia
2019 - 2024 Intelligent Reconstruction and Analysis of Ancient Genomes
CEECINST/00026/2018
Principal investigator
Universidade de Aveiro, Portugal
Fundação para a Ciência e a Tecnologia
Ongoing
2022 - 2023 Ancient Virus Infections in the Chachapoya Population of South American Mountain Forests
not applicable
Researcher
2021 - 2022 Understanding ancient pathogen genomics – understanding future pandemics
not applicable
Researcher
Societas Scientiarum Fennica
2021 - 2022 Understanding ancient pathogen genomics – understanding future pandemics
not applicable
Researcher
2019 - 2021 The Human Tissue Virome - Comprehensive Impact Analysis #1
not applicable
Researcher
Helsingin Yliopisto, Finland
Suomen Lääketieteen Säätiö
Concluded
2016 - 2019 The normalized relative compression distance
PTDC/EEI-SII/6608/2014
Post-doc Fellow
Universidade de Aveiro, Portugal
Fundação para a Ciência e a Tecnologia
Concluded
2013 - 2016 RD-CONNECT - An integrated platform connecting registries, biobanks and clinical bioinformatics for rare disease research
305444
PhD Student Fellow
European Commission
Concluded
2010 - 2013 Analysis of DNA sequences through compression-based complexity profiles
PTDC/EIA-EIA/103099/2008
Research Fellow
Universidade de Aveiro, Portugal
Fundação para a Ciência e a Tecnologia
Concluded
2009 - 2010 Finite context models for DNA
PTDC/EIA/72569/2006
Research Fellow
Universidade de Aveiro, Portugal
Fundação para a Ciência e a Tecnologia
Concluded
Outputs

Publications

Book
  1. Figueiredo, D.; Martín-Vide, C.; Pratas, D.; Vega-Rodríguez, M.A.. Algorithms for Computational Biology. Springer. 2017.
    Published
Book chapter
  1. Pinho, A.J.; Pratas, D.; Garcia, S.P.. "Compressing resequencing data with GReEn". In Deep Sequencing Data Analysis,. 2013.
    10.1007/978-1-62703-514-9_2
Conference paper
  1. Sousa, Maria J. P.; Pratas, Diogo. "A method for accurate reconstruction of persistent human viral sequences". Paper presented in Portuguese Conference on Pattern Recognition, Coimbra, 2023.
    Published
  2. Jorge Miguel Silva; Diogo Pratas; Sérgio Matos. "Exploring Kolmogorov Complexity Approximations for Data Analysis: Insights and Applications". Paper presented in Doctoral Conference on Computing, Electrical and Industrial Systems, 2023.
    Published • 10.1007/978-3-031-36007-7_12
  3. Pratas, Diogo; Pinho, Armando J.. "JARVIS2: a data compressor for large genome sequences". Paper presented in Data Compression Conference, 2023.
    10.1109/dcc55655.2023.00037
  4. J. M. Silva; D. Pratas; T. Caetano; S. Matos. "Feature-Based Classification of Archaeal Sequences Using Compression-Based Methods". Paper presented in Pattern Recognition and Image Analysis, IbPRIA 2022, 2022.
    Published • 10.1007/978-3-031-04881-4_25
  5. Sousa, Maria J. P.; Pratas, Diogo. "A survey on computational tools for human viral genomes reconstruction". Paper presented in Portuguese Conference on Pattern Recognition, Leiria, 2022.
    Published
  6. Sousa, Maria J. P.; Rita Ferrolho; Tiago Fonseca; Armando J. Pinho; Pratas, Diogo. "Improving the compression of a complete Telomere-to- Telomere (T2T) human genome sequence". Paper presented in Portuguese Conference on Pattern Recognition, Leiria, 2022.
    Published
  7. Jorge Miguel Ferreira da Silva; Pratas, Diogo; Caetano, Tania; Matos, Sérgio. "Archaea Taxonomic Classification". Paper presented in 27th Portuguese Conference on Pattern Recognition, RecPad 2021, Évora, 2021.
    Published
  8. Jorge Miguel Ferreira da Silva; Pratas, Diogo; Matos, Sérgio. "Comparison and Evaluation of Information-based Measures in Images.". Paper presented in 26th Portuguese Conference on Pattern Recognition, RecPad 2020, Évora, 2020.
    Published
  9. Pratas, D.; Hosseini, M.; Pinho, A.J.. "Visualization of Similar Primer and Adapter Sequences in Assembled Archaeal Genomes". Paper presented in International Conference on Practical Applications of Computational Biology & Bioinformatics, 2019.
    10.1007/978-3-030-23873-5_16
  10. Pratas, D.; Hosseini, M.; Pinho, A.J.. "GeCo2: An Optimized Tool for Lossless Compression and Analysis of DNA Sequences". Paper presented in International Conference on Practical Applications of Computational Biology & Bioinformatics, 2019.
    10.1007/978-3-030-23873-5_17
  11. Hosseini, M.; Pratas, D.; Pinho, A.J.. "A probabilistic method to find and visualize distinct regions in protein sequences". 2019.
    10.23919/EUSIPCO.2019.8902695
  12. Pratas, D.; Pinho, A.J.. "A DNA sequence corpus for compression benchmark". Paper presented in International Conference on Practical Applications of Computational Biology & Bioinformatics, 2019.
    10.1007/978-3-319-98702-6_25
  13. Hosseini, Morteza; Pratas, Diogo; Armando J. Pinho. "Clustering DNA sequences by relative compression". Paper presented in 25th Portuguese Conference on Pattern Recognition, RecPad 2019, Porto, 2019.
    Published
  14. Jorge Miguel Ferreira da Silva; Pratas, Diogo; Matos, Sérgio. "Evaluation of Statistical Complexity in Viral Genome Sequences". Paper presented in 25th Portuguese Conference on Pattern Recognition, RecPad 2019, Porto, Porto, 2019.
    Published
  15. Pratas, D.; Hosseini, M.; Pinho, A.J.. "Compression of amino acid sequences". Paper presented in International Conference on Practical Applications of Computational Biology & Bioinformatics, 2018.
    10.1007/978-3-319-98702-6_13
  16. Gaspar, M.; Pratas, D.; Pinho, A.J.. "NET-ASAR: A tool for DNA sequence search based on data compression". Paper presented in NET-ASAR: A tool for DNA sequence search based on data compression, 2018.
    10.1007/978-3-319-98702-6_14
  17. Pratas, D.; Pinho, A.J.. "Metagenomic composition analysis of sedimentary ancient DNA from the Isle of Wight". 2018.
    10.23919/EUSIPCO.2018.8553297
  18. Pinho, A.J.; Pratas, D.. "An Application of Data Compression Models to Handwritten Digit Classification". Paper presented in International Conference on Advanced Concepts for Intelligent Vision Systems, 2018.
    10.1007/978-3-030-01449-0_41
  19. Ana Teixeia; Pratas, Diogo; Armando J. Pinho; Raquel M. Silva. "Evolutionary insights from the comparative analysis of hominid genomes". Paper presented in 24th Portuguese Conference on Pattern Recognition, RecPad 2018, Coimbra, 2018.
    Published
  20. Catarina Figueiredo; Pratas, Diogo; Armando J. Pinho; Raquel M. Silva. "Identification of antifungal targets using alignment-free methods". Paper presented in 24th Portuguese Conference on Pattern Recognition, RecPad 2018, Coimbra, 2018.
    Published
  21. Pratas, D.; Hosseini, M.; Pinho, A.J.. "Substitutional tolerant markov models for relative compression of DNA sequences". Paper presented in International Conference on Practical Applications of Computational Biology, 2017.
    10.1007/978-3-319-60816-7_32
  22. Pratas, D.; Hosseini, M.; Pinho, A.J.. "Cryfa: A tool to compact and encrypt FASTA files". Paper presented in 11th International Conference on Practical Applications of Computational Biology & Bioinformatics, 2017.
    10.1007/978-3-319-60816-7_37
  23. Pratas, D.; Pinho, A.J.. "On the approximation of the Kolmogorov complexity for DNA sequences". Paper presented in Book cover Iberian Conference on Pattern Recognition and Image Analysis, 2017.
    10.1007/978-3-319-58838-4_29
  24. Hosseini, M.; Pratas, D.; Pinho, A.J.. "On the role of inverted repeats in DNA sequence similarity". Paper presented in International Conference on Practical Applications of Computational Biology & Bioinformatics, 2017.
    10.1007/978-3-319-60816-7_28
  25. Pratas, D.; Hosseini, M.; Silva, R.M.; Pinho, A.J.; Ferreira, P.J.S.G.. "Visualization of distinct DNA regions of the modern human relatively to a neanderthal genome". Paper presented in Book cover Iberian Conference on Pattern Recognition and Image Analysis, 2017.
    10.1007/978-3-319-58838-4_26
  26. Pratas, D.; Pinho, A.J.; Ferreira, P.J.S.G.. "Efficient Compression of Genomic Sequences". 2016.
    10.1109/DCC.2016.60
  27. Pinho, A.J.; Pratas, D.; Ferreira, P.J.S.G.. "Authorship Attribution Using Relative Compression". 2016.
    10.1109/DCC.2016.53
  28. Pratas, Diogo; Raquel M. Silva; Armando J. Pinho. "Detection and visualisation of regions of human DNA not present in other primates". Paper presented in 21st Portuguese Conference on Pattern Recognition, RecPad 2015, Faro, 2015.
    Published
  29. Pratas, D.; Pinho, A.J.. "A conditional compression distance that unveils insights of the genomic evolution". 2014.
    10.1109/DCC.2014.58
  30. Pratas, D.; Pinho, A.J.. "Exploring deep Markov models in genomic data compression using sequence pre-analysis". 2014.
  31. Pinho, A.J.; Pratas, D.; Ferreira, P.J.S.G.. "Information profiles for DNA pattern discovery". 2014.
    10.1109/DCC.2014.54
  32. Pinho, A.J.; Pratas, D.; Ferreira, P.J.S.G.. "A new compressor for measuring distances among images". Paper presented in International Conference Image Analysis and Recognition, 2014.
    10.1007/978-3-319-11758-4_4
  33. Pratas, Diogo; Raquel M. Silva; Armando J. Pinho. "Large-scale inversions between human reference assemblies". Paper presented in 20th Portuguese Conference on Pattern Recognition, RecPad 2014, Covilhã, 2014.
    Published
  34. Raquel M. Silva; Castro, Luísa; Pratas, Diogo; Armando J. Pinho. "Towards personalized medicine: ebola virus absent words in the human genome". Paper presented in 20th Portuguese Conference on Pattern Recognition, RecPad 2014, Covilhã, 2014.
    Published
  35. Pratas, Diogo; Armando J. Pinho. "Insights into primates genomic evolution using a compression distance". Paper presented in 19th Portuguese Conference on Pattern Recognition, RecPad 2013, Lisbon, 2013.
    Published
  36. Pratas, D.; Pinho, A.J.; Garcia, S.P.. "Computation of the normalized compression distance of DNA sequences using a mixture of finite-context models". 2012.
  37. Pratas, D.; Pinho, A.J.. "On the detection of unknown locally repeating patterns in images". Paper presented in International Conference Image Analysis and Recognition, 2012.
    10.1007/978-3-642-31295-3_19
  38. Pratas, D.; Pinho, A.J.; Garcia, S.P.. "Exon: A web-based software toolkit for DNA sequence analysis". Paper presented in 6th International Conference on Practical Applications of Computational, 2012.
    10.1007/978-3-642-28839-5_25
  39. Matos, L.M.O.; Pratas, D.; Pinho, A.J.. "Compression of whole genome alignments using a mixture of finite-context models". Paper presented in nternational Conference Image Analysis and Recognition, 2012.
    10.1007/978-3-642-31295-3_42
  40. Pratas, Diogo; Armando J. Pinho. "On the compression of FASTQ quality-scores". Paper presented in 18th Portuguese Conference on Pattern Recognition, RecPad 2012, Coimbra, 2012.
    Published
  41. Pratas, Diogo; Armando J. Pinho. "M6: a method for compressing complete genomes using markov models". Paper presented in 7th Doctoral Symposium in Informatics Engineering, DSIE 2012, Porto, 2012.
    Published
  42. Pratas, D.; Bastos, C.A.C.; Pinho, A.J.; Neves, A.J.R.; Matos, L.M.O.. "DNA synthetic sequences generation using multiple competing Markov models". 2011.
    10.1109/SSP.2011.5967639
  43. Pinho, A.J.; Pratas, D.; Ferreira, P.J.S.G.. "Bacteria DNA sequence compression using a mixture of finite-context models". 2011.
    10.1109/SSP.2011.5967637
  44. Pinho, A.J.; Pratas, D.; Ferreira, P.J.S.G.; Garcia, S.P.. "Symbolic to numerical conversion of DNA sequences using finite-context models". 2011.
  45. Pratas, D.; Pinho, A.J.. "Compressing the human genome using exclusively Markov models". Paper presented in 5th International Conference on Practical Applications of Computational, 2011.
    10.1007/978-3-642-19914-1_29
  46. Pinho, A.J.; Pratas, D.; Garcia, S.P.. "Complexity profiles of DNA sequences using finite-context models". Paper presented in Symposium of the Austrian HCI and Usability Engineering Group, 2011.
    10.1007/978-3-642-25364-5_8
  47. Pratas, Diogo; Sara P. Garcia; Armando J. Pinho. "Analysis of patterns in S. pombe genome through compression-based complexity profiles". Paper presented in 17th Portuguese Conference on Pattern Recognition, RecPad 2011, Porto, 2011.
    Published
  48. Pratas, Diogo; Armando J. Pinho. "Analysis of DNA sequences using finite-context modelling and compression". Paper presented in 16th Portuguese Conf. on Pattern Recognition, RecPad 2010, Vila Real, 2010.
    Published
  49. Pratas, Diogo; Armando J. Pinho; Neves, Antonio J. R.; Carlos A. C. Bastos. "DNA synthetic sequences generated by finite-context models". Paper presented in 16th Portuguese Conf. on Pattern Recognition, RecPad 2010, Vila Real, 2010.
    Published
Conference poster
  1. 761B-0575-6338 ; 671E-AA3E-3770; Sousa, Sérgio F.; Pratas, Diogo; Carneiro, João. "Decoding the Genomic Diversity of Hepatitis E Virus in European Rabbits: A Step Towards Understanding Zoonotic Transmission". Paper presented in "The 20th Portugaliæ Genetica: DNA - Ancient and New" 21-22 March 2024, 2024.
  2. 761B-0575-6338 ; 671E-AA3E-3770; Sousa, Sérgio F.; Pratas, Diogo; Carneiro, João. "Evolutionary Insights into Plastic-Degrading Enzymes: A Data-Driven Approach to Bioremediation". Paper presented in "The 20th Portugaliæ Genetica: DNA - Ancient and New" 21-22 March 2024, 2024.
  3. Mariana Fernandes; Clara Cerqueira; Pratas, Diogo; Sousa, Sérgio F.; Carneiro, João. "Exploring the Evolution of PET-Degrading Enzymes: Insights from Sequence Alignment and Phylogenetic Analysis". Paper presented in "The 20th Portugaliæ Genetica: DNA - Ancient and New" 21-22 March 2024, 2024.
  4. Clara Cerqueira; E11D-C109-D995; Sousa, Sérgio F.; Pratas, Diogo; Carneiro, João. "Creating a Comprehensive Database of Plastic Degrading Enzymes for Machine Learning Applications". Paper presented in Bioinformatics Open Days XIII Edition 14-16 March 2024, 2024.
  5. Mariana Fernandes; Clara Cerqueira; Pratas, Diogo; Sousa, Sérgio F.; Carneiro, João. "Unifying Information on Plastic Degrading Enzymes Across Different Databases". Paper presented in Bioinformatics Open Days XIII Edition 14-16 March 2024, 2024.
Journal article
  1. Silva, Jorge M; Qi, Weihong; Pinho, Armando J; Pratas, Diogo. "AlcoR: alignment-free simulation, mapping, and visualization of low-complexity regions in biological data". GigaScience 12 (2023): http://dx.doi.org/10.1093/gigascience/giad101.
    10.1093/gigascience/giad101
  2. João Carneiro; Francisco Pascoal; Miguel Semedo; Diogo Pratas; Maria Paola Tomasino; Adriana Rego; Carvalho MF; Ana Paula Mucha; Catarina Magalhães. "Mapping human pathogens in wastewater using a metatranscriptomic approach". Environmental Research (2023): 116040-116040. http://dx.doi.org/10.1016/j.envres.2023.116040.
    10.1016/j.envres.2023.116040
  3. João Carneiro; Rita P. Magalhães; Victor M. de la Oliva Roque; Manuel Simões; D. Pratas; Sergio F. Sousa. "TargIDe: a machine-learning workflow for target identification of molecules with antibiofilm activity against Pseudomonas aeruginosa". Journal of Computer-Aided Molecular Design (2023): http://dx.doi.org/10.1007/s10822-023-00505-5.
    10.1007/s10822-023-00505-5
  4. Lari Pyöriä; D. Pratas; Mari Toppinen; Klaus Hedman; Antti Sajantila; Maria F. Perdomo. "Elimistömme on lukuisten terveyteemme vaikuttavien virusten koti". Duodecim 139 8 (2023): https://researchportal.helsinki.fi/en/publications/be81265b-11ca-4bdd-832d-11269fc86887.
  5. Lari Pyöriä; D. Pratas; Mari Toppinen; Klaus Hedman; Antti Sajantila; Maria F. Perdomo. "Unmasking the tissue-resident eukaryotic DNA virome in humans". Nucleic Acids Research (2023): http://dx.doi.org/10.1093/nar/gkad199.
    10.1093/nar/gkad199
  6. Maria Jauhiainen; Ushanandini Mohanraj; Martin Lehecka; Mika Niemelä; Timo P. Hirvonen; Diogo Pratas; Maria F. Perdomo; et al. "Herpesviruses, polyomaviruses, parvoviruses, papillomaviruses, and anelloviruses in vestibular schwannoma". Journal of NeuroVirology (2023): http://dx.doi.org/10.1007/s13365-023-01112-8.
    10.1007/s13365-023-01112-8
  7. Weihong Qi; Yi-Wen Lim; Andrea Patrignani; Pascal Schläpfer; Anna Bratus-Neuenschwander; Simon Grüter; Christelle Chanez; et al. Corresponding author: Wilhelm Gruissem. "The haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar reveal novel pan-genome and allele-specific transcriptome features". GigaScience 11 (2022): http://dx.doi.org/10.1093/gigascience/giac028.
    10.1093/gigascience/giac028
  8. Outi Ilona Mielonen; D. Pratas; Klaus Hedman; Antti Sajantila; Maria Fernanda Perdomo Cruz. "Detection of Low-Copy Human Virus DNA upon Prolonged Formalin Fixation". Viruses (2022): https://www.mdpi.com/1999-4915/14/1/133.
    10.3390/v14010133
  9. Silva, Jorge Miguel; Pratas, Diogo; Caetano, Tânia; Matos, Sérgio. "The complexity landscape of viral genomes". GigaScience 11 (2022): http://dx.doi.org/10.1093/gigascience/giac079.
    10.1093/gigascience/giac079
  10. J Monteiro; Pratas, Diogo; A Videira; F Pereira. Corresponding author: F Pereira. "Revisiting the Neurospora crassa mitochondrial genome". Letters in Applied Microbiology 73 4 (2021): 495-505. http://dx.doi.org/10.1111/lam.13538.
    10.1111/lam.13538
  11. Mari Toppinen; A Sajantila; Pratas, Diogo; K Hedman; MF Perdomo. Corresponding author: MF Perdomo. "The Human Bone Marrow Is Host to the DNAs of Several Viruses". Frontiers in Cellular and Infection Microbiology 11 (2021): http://dx.doi.org/10.3389/fcimb.2021.657245.
    10.3389/fcimb.2021.657245
  12. Milton Silva; D. Pratas; Armando J Pinho. "AC2: An Efficient Protein Sequence Compression Tool Using Artificial Neural Networks and Cache-Hash Models". Entropy (2021): https://www.mdpi.com/1099-4300/23/5/530.
    10.3390/e23050530
  13. Silva, J.M.; Pratas, D.; Antunes, R.; Matos, S.; Pinho, A.J.. "Automatic analysis of artistic paintings using information-based measures". Pattern Recognition 114 (2021): http://www.scopus.com/inward/record.url?eid=2-s2.0-85100446723&partnerID=MN8TOARS.
    10.1016/j.patcog.2021.107864
  14. Almeida, J.R.; Pratas, D.; Oliveira, J.L.. "A semi-automatic methodology for analysing distributed and private biobanks". Computers in Biology and Medicine 130 (2021): http://www.scopus.com/inward/record.url?eid=2-s2.0-85098456035&partnerID=MN8TOARS.
    10.1016/j.compbiomed.2020.104180
  15. Pratas, Diogo. "Efficient DNA sequence compression with neural networks". GigaScience 9 11 (2020): http://dx.doi.org/10.1093/gigascience/giaa119.
    10.1093/gigascience/giaa119
  16. Pratas, Diogo. "The landscape of persistent human DNA viruses in femoral bone". Forensic Science International: Genetics (2020): http://dx.doi.org/10.1016/j.fsigen.2020.102353.
    10.1016/j.fsigen.2020.102353
  17. Pratas, Diogo. "A hybrid pipeline for reconstruction and analysis of viral genomes at multi-organ level". GigaScience (2020): http://dx.doi.org/10.1093/gigascience/giaa086.
    10.1093/gigascience/giaa086
  18. Pratas, Diogo. "Persistent minimal sequences of SARS-CoV-2". Bioinformatics (2020): http://dx.doi.org/10.1093/bioinformatics/btaa686.
    10.1093/bioinformatics/btaa686
  19. Pratas, Diogo. "GTO: A toolkit to unify pipelines in genomic and proteomic research". SoftwareX (2020): http://dx.doi.org/10.1016/j.softx.2020.100535.
    10.1016/j.softx.2020.100535
  20. Pratas, Diogo. "Smash++: an alignment-free and memory-efficient tool to find genomic rearrangements". GigaScience (2020): http://dx.doi.org/10.1093/gigascience/giaa048.
    10.1093/gigascience/giaa048
  21. Jorge Miguel Silva; Eduardo Pinho; Sérgio Matos; D. Pratas. "Statistical Complexity Analysis of Turing Machine tapes with Fixed Algorithmic Complexity Using the Best-Order Markov Model". Entropy (2020): https://www.mdpi.com/1099-4300/22/1/105.
    10.3390/e22010105
  22. D. Pratas; Morteza Hosseini; Jorge Miguel Silva; Armando J Pinho. "A Reference-Free Lossless Compression Algorithm for DNA Sequences Using a Competitive Prediction of Two Classes of Weighted Models". Entropy (2019): https://www.mdpi.com/1099-4300/21/11/1074.
    10.3390/e21111074
  23. Hosseini, M.; Pratas, D.; Pinho, A.J.. "AC: A Compression Tool for Amino Acid Sequences". Interdisciplinary Sciences: Computational Life Sciences 11 1 (2019): 68-76. http://www.scopus.com/inward/record.url?eid=2-s2.0-85062690507&partnerID=MN8TOARS.
    10.1007/s12539-019-00322-1
  24. Hosseini, M.; Pratas, D.; Pinho, A.J.. "Cryfa: A secure encryption tool for genomic data". Bioinformatics 35 1 (2019): 146-148. http://www.scopus.com/inward/record.url?eid=2-s2.0-85058744156&partnerID=MN8TOARS.
    10.1093/bioinformatics/bty645
  25. D. Pratas; Morteza Hosseini; Gonçalo Grilo; Armando J Pinho; Raquel M Silva; Caetano T; João Carneiro; Filipe Pereira. "Metagenomic Composition Analysis of an Ancient Sequenced Polar Bear Jawbone from Svalbard". Genes (2018): http://www.mdpi.com/2073-4425/9/9/445.
    10.3390/genes9090445
  26. Pratas, Diogo. "Comparison of Compression-Based Measures with Application to the Evolution of Primate Genomes". Entropy (2018): http://www.mdpi.com/1099-4300/20/6/393.
    10.3390/e20060393
  27. Carvalho, João M.; Brás, Susana; Pratas, Diogo; Ferreira, Jacqueline; Soares, Sandra C.; Pinho, Armando J.. "Extended-alphabet finite-context models". (2018): http://hdl.handle.net/10773/27612.
    10.1016/j.patrec.2018.05.026
  28. Hosseini, M.; Pratas, D.; Pinho, A.J.. "A survey on data compression methods for biological sequences". Information (Switzerland) 7 4 (2016): http://www.scopus.com/inward/record.url?eid=2-s2.0-85007393441&partnerID=MN8TOARS.
    10.3390/info7040056
  29. Pratas, D.; Silva, R.M.; Pinho, A.J.; Ferreira, P.J.S.G.. "An alignment-free method to find and visualise rearrangements between pairs of DNA sequences". Scientific Reports 5 (2015): http://www.scopus.com/inward/record.url?eid=2-s2.0-84929429321&partnerID=MN8TOARS.
    10.1038/srep10203
  30. Matos, L.M.O.; Neves, A.J.R.; Pratas, D.; Pinho, A.J.. "Mafco: A compression tool for MAF files". PLoS ONE 10 3 (2015): http://www.scopus.com/inward/record.url?eid=2-s2.0-84929484087&partnerID=MN8TOARS.
    10.1371/journal.pone.0116082
  31. Silva, R.M.; Pratas, D.; Castro, L.; Pinho, A.J.; Ferreira, P.J.S.G.. "Three minimal sequences found in Ebola virus genomes and absent from human DNA". Bioinformatics 31 15 (2015): 2421-2425. http://www.scopus.com/inward/record.url?eid=2-s2.0-84943639957&partnerID=MN8TOARS.
    10.1093/bioinformatics/btv189
  32. Pratas, D.; Pinho, A.J.; Rodrigues, J.M.. "XS: a FASTQ read simulator.". BMC research notes 7 (2014): http://www.scopus.com/inward/record.url?eid=2-s2.0-84908135542&partnerID=MN8TOARS.
    10.1186/1756-0500-7-40
  33. Pinho, A.J.; Pratas, D.. "Mfcompress: A compression tool for fasta and multi-fasta data". Bioinformatics 30 1 (2014): 117-118. http://www.scopus.com/inward/record.url?eid=2-s2.0-84891355058&partnerID=MN8TOARS.
    10.1093/bioinformatics/btt594
  34. De Matos, L.M.O.; Pratas, D.; Pinho, A.J.. "A compression model for DNA multiple sequence alignment blocks". IEEE Transactions on Information Theory 59 5 (2013): 3189-3198. http://www.scopus.com/inward/record.url?eid=2-s2.0-84876759103&partnerID=MN8TOARS.
    10.1109/TIT.2012.2236605
  35. Garcia, S.P.; Rodrigues, J.M.O.S.; Santos, S.; Pratas, D.; Afreixo, V.; Bastos, C.A.C.; Ferreira, P.J.S.G.; Pinho, A.J.. "A genomic distance for assembly comparison based on compressed maximal exact matches". IEEE/ACM Transactions on Computational Biology and Bioinformatics 10 3 (2013): 793-798. http://www.scopus.com/inward/record.url?eid=2-s2.0-84887940267&partnerID=MN8TOARS.
    10.1109/TCBB.2013.77
  36. Pinho, A.J.; Garcia, S.P.; Pratas, D.; Ferreira, P.J.S.G.. "DNA sequences at a glance". PLoS ONE 8 11 (2013): http://www.scopus.com/inward/record.url?eid=2-s2.0-84896690677&partnerID=MN8TOARS.
    10.1371/journal.pone.0079922
  37. Pinho, A.J.; Pratas, D.; Garcia, S.P.. "GReEn: A tool for efficient compression of genome resequencing data". Nucleic Acids Research 40 4 (2012): http://www.scopus.com/inward/record.url?eid=2-s2.0-84857860662&partnerID=MN8TOARS.
    10.1093/nar/gkr1124
Preprint
  1. Jorge Miguel Silva; Weihong Qi; Armando J Pinho; D. Pratas. "AlcoR: alignment-free simulation, mapping, and visualization of low-complexity regions in biological data". 2022. http://dx.doi.org/10.1101/2023.04.17.537157.
    10.1101/2023.04.17.537157
  2. Diogo Pratas; Armando J. Pinho; Raquel M. Silva; João M. O. S. Rodrigues; Morteza Hosseini; Tânia Caetano; Paulo J. S. G. Ferreira. "FALCON-meta: a method to infer metagenomic composition of ancient DNA". 2018. https://doi.org/10.1101/267179.
    10.1101/267179
Thesis / Dissertation
  1. Pratas, Diogo. "Compression and analysis of genomic data". PhD, 2016. http://hdl.handle.net/10773/16286.

Other

Software
  1. Jorge Miguel Silva; Pratas, Diogo. "canvas: Complexity Analysis Viral Sequences". 1.0. Universidade de Aveiro Instituto de Engenharia Electrónica e Telemática de Aveiro. https://github.com/jorgeMFS/canvas. 2022.
Activities

Supervision

Thesis Title
Role
Degree Subject (Type)
Institution / Organization
2023 - Current Intelligent reconstruction and analysis of viral genome sequences
Supervisor
Universidade de Aveiro, Portugal
2023 - Current Study of the impact of data compression on reducing energy consumption
Supervisor
Universidade de Aveiro, Portugal
2023 - Current Age estimation of ancient DNA samples in archaeology
Supervisor
Universidade de Aveiro, Portugal
2022 - Current Parameter optimization for improving data compession of DNA sequences
Supervisor
Universidade de Aveiro, Portugal
2023 - 2024 Machine Learning-Enhanced Optimization of Plastic-Degrading Enzymes for Sustainable Ocean Cleanup
Supervisor
Universidade do Porto, Portugal
2023 - 2024 Designing Optimal 3D Enzyme Computational Models for Efficient Plastic Degradation
Co-supervisor
Universidade do Porto, Portugal
2023 - 2024 Designing In-Silico Aptamers for Potential Use in Marine Bioremediation
Co-supervisor
Universidade do Porto, Portugal
2023 - 2024 Genomic Diversity and Zoonotic Potential of Hepatitis E Virus in European Rabbits: Implications for Diagnostic and Therapeutic Approaches
Supervisor
Universidade do Porto, Portugal
2022 - 2023 Impact of sorting in DNA sequence compression
Supervisor
Universidade de Aveiro, Portugal
2022 - 2023 Improving a Database of Cyanobacterial Bioactive Compounds that can be used for Therapeutic Approaches in Human Diseases.
Co-supervisor
Universidade do Porto, Portugal
2022 - 2023 Automatic reconstruction of persistent human virus genome
Supervisor
Universidade de Aveiro, Portugal
2019 - 2023 Algorithmic Information Approximations in Data Analysis
Co-supervisor
Universidade de Aveiro, Portugal
2020 - 2021 Reconstruction and classification of unknown DNA sequences
Supervisor
Universidade de Aveiro, Portugal
2020 - 2021 Efficient biosequence compression using neural network
Supervisor
Universidade de Aveiro, Portugal
2016 - 2020 Compression models and tools for omics data
Co-supervisor
Universidade de Aveiro, Portugal
2016 - 2017 Automatic system for approximate and noncontiguous DNA sequences search
Co-supervisor
Universidade de Aveiro, Portugal

Event organisation

Event name
Type of event (Role)
Institution / Organization
2022 - Current Liga Portuguesa de Bioinformática (https://lpb.pt) (2022)
Other (Co-organisor)
2021 - 2022 Iberian Conference on Pattern Recognition and Image Analysis (2022/05/04 - 2022/05/06)
Conference (Member of the Organising Committee)
Universidade de Aveiro, Portugal
2019/03 - 2019/06 Workshop on Genomics for Physicians (2019/04 - 2019/06)
Workshop (Co-organisor)
Helsingin Yliopisto, Finland
2017 - 2017 International Conference on Algorithms for Computational Biology (2017/06/05 - 2017/06/06)
Conference (Co-organisor)
Universidade de Aveiro, Portugal
2016 - 2016 Portuguese Conference on Pattern Recognition (2016/10/28 - 2016/10/28)
Conference (Member of the Organising Committee)
Universidade de Aveiro, Portugal

Jury of academic degree

Topic
Role
Candidate name (Type of degree)
Institution / Organization
2023 Molecular evolution of DNA topoisomerases in animals
(Thesis) Arguer
Filipa Moreira (PhD)
Universidade do Porto Instituto de Ciências Biomédicas Abel Salazar, Portugal
2022 Development of DNA sequence classifiers based on deep learning
(Thesis) Main arguer
João Abreu (Master)
Universidade do Minho, Portugal

Association member

Society Organization name Role
2019 - Current European Society for Clinical Virology Membro
2015 - Current Portuguese Association for Pattern Recognition Membro (ex Secretário)
2007 - Current Super Dimension Fortress
2019 - 2020 International Society for Computational Biology Member

Course / Discipline taught

Academic session Degree Subject (Type) Institution / Organization
2019 - Current Algorithmic Information Theory (Mestrado) Universidade de Aveiro, Portugal
2013 - 2014 Programming I (Licenciatura) Universidade de Aveiro, Portugal

Evaluation committee

Activity description
Role
Institution / Organization Funding entity
2023/06/15 - 2023/06/16 International Review Panel – Future Digital Challenge Concept Phase Review – Green Transition and Digital Transformation
Evaluator
Science Foundation Ireland, Ireland Science Foundation Ireland
Distinctions

Other distinction

2018 Award of scientific excellence, Toledo, Spain
2018 Best oral communication: "Metagenomic composition analysis of ancient DNA samples", 18th Portugaliæ Genetica Genetic Diversity in Structure and Regulation, 22 & 23 March 2018, I3S, Porto, Portugal
2015 Best paper at RECPAD 2015, Faro, Portugal
Universidade do Algarve, Portugal
2013 Honor mention at Research Day 2013
Universidade de Aveiro, Portugal
2012 PAAMS'12 Award of scientific excellence, Salamanca, Spain