<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd">
  <mods>
    <titleInfo>
      <title>Data quality checking for machine learning with MeSQuaL [demonstration paper]</title>
    </titleInfo>
    <name type="personnal">
      <namePart type="family">Comignani</namePart>
      <namePart type="given">U.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Novelli</namePart>
      <namePart type="given">N.</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <name type="personnal">
      <namePart type="family">Berti-Equille</namePart>
      <namePart type="given">Laure</namePart>
      <role>
        <roleTerm type="text">auteur</roleTerm>
        <roleTerm type="code" authority="marcrelator">aut</roleTerm>
      </role>
      <affiliation>IRD</affiliation>
    </name>
    <typeOfResource>text</typeOfResource>
    <genre authority="local">bookSection</genre>
    <language>
      <languageTerm type="code" authority="iso639-2b">eng</languageTerm>
    </language>
    <physicalDescription>
      <internetMediaType>text/pdf</internetMediaType>
      <digitalOrigin>reformatted digital</digitalOrigin>
      <reformattingQuality>access</reformattingQuality>
    </physicalDescription>
    <abstract>This demo proposes MeSQuaL, a system for profiling and check-ing data quality before further tasks, such as data analytics and machine learning. MeSQuaL extends SQL for querying relational data with constraints on data quality and facilitates the verification of statistical tests. The system includes: (1) a query interpreter for SQuaL, the SQL-extended language we propose for declaring and querying data with data quality checks and statistical tests; (2) an extensible library of user-defined functions for profiling the data and computing various data quality indicators ;and (3) a user interface for declaring data quality constraints, profiling data, monitoring data quality with SQuaL queries, and visualizing the results via data quality dashboards. We showcaseour system in action with various scenarios on real-world datasets and show its usability for monitoring data quality over timeand checking the quality of data on-demand.</abstract>
    <targetAudience authority="marctarget">specialized</targetAudience>
    <classification authority="local">122</classification>
    <relatedItem type="host">
      <titleInfo>
        <title>Advances in database technology : EDBT 2020</title>
      </titleInfo>
      <name type="personnal">
        <namePart type="family">Bonifati</namePart>
        <namePart type="given">A.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Zhou</namePart>
        <namePart type="given">I.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Vaz Salles</namePart>
        <namePart type="given">M. A.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Böhm</namePart>
        <namePart type="given">A.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Olteanu</namePart>
        <namePart type="given">D.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Fletcher</namePart>
        <namePart type="given">G.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Khan</namePart>
        <namePart type="given">A.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <name type="personnal">
        <namePart type="family">Yang</namePart>
        <namePart type="given">B.</namePart>
        <role>
          <roleTerm type="text">ed.</roleTerm>
          <roleTerm type="code" authority="marcrelator">edt</roleTerm>
        </role>
        <affiliation>IRD</affiliation>
      </name>
      <part>
        <detail type="volume">
          <number>23</number>
        </detail>
        <extent unit="pages">
          <list>  591-594</list>
        </extent>
      </part>
      <originInfo>
        <place type="text">
          <placeTerm>Constance</placeTerm>
        </place>
        <publisher>Open Proceedings</publisher>
        <dateIssued key="date">2020</dateIssued>
      </originInfo>
      <name type="conference">
        <namePart>International Conference on Extending Database Technology, 23., Copenhague (DNK), 2020/30/03-2020/04/02</namePart>
      </name>
    </relatedItem>
    <relatedItem type="series">
      <titleInfo>
        <title>Open Proceedings</title>
        <partNumber>23</partNumber>
      </titleInfo>
    </relatedItem>
    <identifier type="uri">https://www.documentation.ird.fr/hor/fdi:010078830</identifier>
    <identifier type="doi">10.5441/002/edbt.2020.71</identifier>
    <identifier type="isbn">978-3-89318-083-7</identifier>
    <location>
      <shelfLocator>[F B010078830]</shelfLocator>
      <url usage="primary display" access="object in context">https://www.documentation.ird.fr/hor/fdi:010078830</url>
      <url access="row object">https://horizon.documentation.ird.fr/exl-doc/pleins_textes/divers20-07/010078830.pdf</url>
    </location>
    <recordInfo>
      <recordContentSource>IRD - Base Horizon / Pleins textes</recordContentSource>
      <recordCreationDate encoding="w3cdtf">2020-07-20</recordCreationDate>
      <recordChangeDate encoding="w3cdtf">2020-07-31</recordChangeDate>
      <recordIdentifier>fdi:010078830</recordIdentifier>
      <languageOfCataloging>
        <languageTerm authority="iso639-2b">fre</languageTerm>
      </languageOfCataloging>
    </recordInfo>
  </mods>
</modsCollection>
