<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
  <mods>
    <titleInfo>
      <title>InpactorDB : a classified lineage-level plant LTR retrotransposon reference library for free-alignment methods based on machine learning</title>
    </titleInfo>
    <name type="personnal">
      <namePart type="family">Orozco-Arias</namePart>
      <namePart type="given">S.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Jaimes</namePart>
      <namePart type="given">P. A.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Candamil</namePart>
      <namePart type="given">M. S.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Jimenez-Varon</namePart>
      <namePart type="given">C. F.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Tabares-Soto</namePart>
      <namePart type="given">R.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Isaza</namePart>
      <namePart type="given">G.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Guyot</namePart>
      <namePart type="given">Romain</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <typeOfResource>text</typeOfResource>
    <genre authority="local">journalArticle</genre>
    <language>
      <languageTerm type="code" authority="iso639-2b">eng</languageTerm>
    </language>
    <physicalDescription>
      <internetMediaType>text/pdf</internetMediaType>
      <digitalOrigin>reformatted digital</digitalOrigin>
      <reformattingQuality>access</reformattingQuality>
    </physicalDescription>
    <abstract>Long terminal repeat (LTR) retrotransposons are mobile elements that constitute the major fraction of most plant genomes. The identification and annotation of these elements via bioinformatics approaches represent a major challenge in the era of massive plant genome sequencing. In addition to their involvement in genome size variation, LTR retrotransposons are also associated with the function and structure of different chromosomal regions and can alter the function of coding regions, among others. Several sequence databases of plant LTR retrotransposons are available for public access, such as PGSB and RepetDB, or restricted access such as Repbase. Although these databases are useful to identify LTR-RTs in new genomes by similarity, the elements of these databases are not fully classified to the lineage (also called family) level. Here, we present InpactorDB, a semi-curated dataset composed of 130,439 elements from 195 plant genomes (belonging to 108 plant species) classified to the lineage level. This dataset has been used to train two deep neural networks (i.e., one fully connected and one convolutional) for the rapid classification of these elements. In lineage-level classification approaches, we obtain up to 98% performance, indicated by the F1-score, precision and recall scores.</abstract>
    <targetAudience authority="marctarget">specialized</targetAudience>
    <subject>
      <topic>LTR retrotransposons</topic>
      <topic>machine learning</topic>
      <topic>deep neural networks</topic>
      <topic>bioinformatics</topic>
      <topic>plant genomes</topic>
      <topic>genomics</topic>
      <topic>InpactorDB</topic>
    </subject>
    <classification authority="local">020</classification>
    <classification authority="local">122</classification>
    <relatedItem type="host">
      <titleInfo>
        <title>Genes</title>
      </titleInfo>
      <part>
        <detail type="volume">
          <number>12</number>
        </detail>
        <detail type="volume">
          <number>2</number>
        </detail>
        <extent unit="pages">
          <list> 190 [17 p.]</list>
        </extent>
      </part>
      <originInfo>
        <dateIssued>2021</dateIssued>
      </originInfo>
    </relatedItem>
    <identifier type="uri">https://www.documentation.ird.fr/hor/fdi:010081058</identifier>
    <identifier type="doi">10.3390/genes12020190</identifier>
    <location>
      <shelfLocator>[F B010081058]</shelfLocator>
      <url usage="primary display" access="object in context">https://www.documentation.ird.fr/hor/fdi:010081058</url>
      <url access="row object">https://horizon.documentation.ird.fr/exl-doc/pleins_textes/divers21-03/010081058.pdf</url>
    </location>
    <recordInfo>
      <recordContentSource>IRD - Base Horizon / Pleins textes</recordContentSource>
      <recordCreationDate encoding="w3cdtf">2021-04-01</recordCreationDate>
      <recordChangeDate encoding="w3cdtf">2024-11-08</recordChangeDate>
      <recordIdentifier>fdi:010081058</recordIdentifier>
      <languageOfCataloging>
        <languageTerm authority="iso639-2b">fre</languageTerm>
      </languageOfCataloging>
    </recordInfo>
  </mods>
</modsCollection>
