Show simple item record

dc.contributor.authorPrifti, Edi
dc.contributor.authorChevaleyre, Yann
dc.contributor.authorHanczar, Blaise
dc.contributor.authorBelda, Eugeni
dc.contributor.authorDanchin, Antoine
dc.contributor.authorClément, Karine
dc.date.accessioned2020-06-03T14:20:48Z
dc.date.available2020-06-03T14:20:48Z
dc.date.issued2020
dc.identifier.urihttps://basepub.dauphine.fr/handle/123456789/20819
dc.language.isoenen
dc.subjectPredictionen
dc.subjectInterpretable modelsen
dc.subjectMetagenomics biomarkersen
dc.subjectMicrobial ecosystemsen
dc.subject.ddc005en
dc.titleInterpretable and accurate prediction models for metagenomics dataen
dc.typeArticle accepté pour publication ou publié
dc.description.abstractenBackground: Microbiome biomarker discovery for patient diagnosis, prognosis, and risk evaluation is attracting broad interest. Selected groups of microbial features provide signatures that characterize host disease states such as cancer or cardio-metabolic diseases. Yet, the current predictive models stemming from machine learning still behave as black boxes and seldom generalize well. Their interpretation is challenging for physicians and biologists, which makes them difficult to trust and use routinely in the physician–patient decision-making process. Novel methods that provide interpretability and biological insight are needed. Here, we introduce “predomics”, an original machine learning approach inspired by microbial ecosystem interactions that is tailored for metagenomics data. It discovers accurate predictive signatures and provides unprecedented interpretability. The decision provided by the predictive model is based on a simple, yet powerful score computed by adding, subtracting, or dividing cumulative abundance of microbiome measurements. Results: Tested on >100 datasets, we demonstrate that predomics models are simple and highly interpretable. Even with such simplicity, they are at least as accurate as state-of-the-art methods. The family of best models, discovered during the learning process, offers the ability to distil biological information and to decipher the predictability signatures of the studied condition. In a proof-of-concept experiment, we successfully predicted body corpulence and metabolic improvement after bariatric surgery using pre-surgery microbiome data. Conclusions: Predomics is a new algorithm that helps in providing reliable and trustworthy diagnostic decisions in the microbiome field. Predomics is in accord with societal and legal requirements that plead for an explainable artificial intelligence approach in the medical field.en
dc.relation.isversionofjnlnameGigaScience
dc.relation.isversionofjnlvol9en
dc.relation.isversionofjnlissue3en
dc.relation.isversionofjnldate2020-03
dc.relation.isversionofdoi10.1093/gigascience/giaa010en
dc.subject.ddclabelProgrammation, logiciels, organisation des donnéesen
dc.relation.forthcomingnonen
dc.relation.forthcomingprintnonen
dc.description.ssrncandidatenonen
dc.description.halcandidatenonen
dc.description.readershiprechercheen
dc.description.audienceInternationalen
dc.relation.Isversionofjnlpeerreviewednonen
dc.relation.Isversionofjnlpeerreviewednonen
dc.date.updated2020-06-03T14:09:00Z
hal.person.labIds541946
hal.person.labIds989
hal.person.labIds143247
hal.person.labIds542105
hal.person.labIds106187
hal.person.labIds1002956


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record