<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
  <mods>
    <titleInfo>
      <title>K-mer-based machine learning method to classify LTR-retrotransposons in plant genomes</title>
    </titleInfo>
    <name type="personnal">
      <namePart type="family">Orozco-Arias</namePart>
      <namePart type="given">S.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Candamil-Cortes</namePart>
      <namePart type="given">M. S.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Jaimes</namePart>
      <namePart type="given">P. A.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Pina</namePart>
      <namePart type="given">J. S.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Tabares-Soto</namePart>
      <namePart type="given">R.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Guyot</namePart>
      <namePart type="given">Romain</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Isaza</namePart>
      <namePart type="given">G.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <typeOfResource>text</typeOfResource>
    <genre authority="local">journalArticle</genre>
    <language>
      <languageTerm type="code" authority="iso639-2b">eng</languageTerm>
    </language>
    <physicalDescription>
      <internetMediaType>text/pdf</internetMediaType>
      <digitalOrigin>reformatted digital</digitalOrigin>
      <reformattingQuality>access</reformattingQuality>
    </physicalDescription>
    <abstract>Every day more plant genomes are available in public databases and additional massive sequencing projects (i.e., that aim to sequence thousands of individuals) are formulated and released. Nevertheless, there are not enough automatic tools to analyze this large amount of genomic information. LTR retrotransposons are the most frequent repetitive sequences in plant genomes; however, their detection and classification are commonly performed using semi-automatic and time-consuming programs. Despite the availability of several bioinformatic tools that follow different approaches to detect and classify them, none of these tools can individually obtain accurate results. Here, we used Machine Learning algorithms based on k-mer counts to classify LTR retrotransposons from other genomic sequences and into lineages/families with an F1-Score of 95%, contributing to develop a free-alignment and automatic method to analyze these sequences.</abstract>
    <targetAudience authority="marctarget">specialized</targetAudience>
    <subject>
      <topic>Transposable elements</topic>
      <topic>LTR retrotransposons</topic>
      <topic>Plant genomes</topic>
      <topic>Machine learning</topic>
      <topic>Classification</topic>
      <topic>Free-alignment approach</topic>
      <topic>k-mer based method</topic>
    </subject>
    <classification authority="local">076</classification>
    <classification authority="local">020</classification>
    <classification authority="local">122</classification>
    <relatedItem type="host">
      <titleInfo>
        <title>PeerJ</title>
      </titleInfo>
      <part>
        <detail type="volume">
          <number>9</number>
        </detail>
        <extent unit="pages">
          <list> e11456 [20 p.]</list>
        </extent>
      </part>
      <originInfo>
        <dateIssued>2021</dateIssued>
      </originInfo>
      <identifier type="issn">2167-8359</identifier>
    </relatedItem>
    <identifier type="uri">https://www.documentation.ird.fr/hor/fdi:010081522</identifier>
    <identifier type="doi">10.7717/peerj.11456</identifier>
    <identifier type="issn">2167-8359</identifier>
    <location>
      <shelfLocator>[F B010081522]</shelfLocator>
      <url usage="primary display" access="object in context">https://www.documentation.ird.fr/hor/fdi:010081522</url>
      <url access="row object">https://horizon.documentation.ird.fr/exl-doc/pleins_textes/2021-07/010081522.pdf</url>
    </location>
    <recordInfo>
      <recordContentSource>IRD - Base Horizon / Pleins textes</recordContentSource>
      <recordCreationDate encoding="w3cdtf">2021-07-02</recordCreationDate>
      <recordChangeDate encoding="w3cdtf">2021-07-02</recordChangeDate>
      <recordIdentifier>fdi:010081522</recordIdentifier>
      <languageOfCataloging>
        <languageTerm authority="iso639-2b">fre</languageTerm>
      </languageOfCataloging>
    </recordInfo>
  </mods>
</modsCollection>
