• Login
     
    View Item 
    •   ORFEO Home
    • Royal Belgian Institute for Space Aeronomy
    • BIRA-IASB publications
    • View Item
    •   ORFEO Home
    • Royal Belgian Institute for Space Aeronomy
    • BIRA-IASB publications
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Machine learning for automatic identification of new minor species

    Thumbnail
    View/Open
    Schmidt(2021a).pdf (1.613Mb)
    Authors
    Schmidt, F.
    Cruz Mermy, G.
    Erwin, J.
    Robert, S.
    Neary, L.
    Thomas, I.R.
    Daerden, F.
    Ristic, B.
    Patel, M.R.
    Bellucci, G.
    Lopez-Moreno, J.-J.
    Vandaele, A.-C.
    Show allShow less
    Discipline
    Physical sciences
    Subject
    Spectroscopy
    Atmosphere
    Data mining
    Machine learning
    Unsupervised
    Source separation
    Non-negative matrix factorization
    Audience
    Scientific
    Date
    2021
    Metadata
    Show full item record
    Description
    One of the main difficulties to analyze modern spectroscopic datasets is due to the large amount of data. For example, in atmospheric transmittance spectroscopy, the solar occultation channel (SO) of the NOMAD instrument onboard the ESA ExoMars2016 satellite called Trace Gas Orbiter (TGO) had produced  ~ 10 millions of spectra in  ~ 20000 acquisition sequences since the beginning of the mission in April 2018 until 15 January 2020. Other datasets are even larger with  ~ billions of spectra for OMEGA onboard Mars Express or CRISM onboard Mars Reconnaissance Orbiter. Usually, new lines are discovered after a long iterative process of model fitting and manual residual analysis. Here we propose a new method based on unsupervised machine learning, to automatically detect new minor species. Although precise quantification is out of scope, this tool can also be used to quickly summarize the dataset, by giving few endmembers (”source”) and their abundances. The methodology is the following: we proposed a way to approximate the dataset non-linearity by a linear mixture of abundance and source spectra (endmembers). We used unsupervised source separation in form of non-negative matrix factorization to estimate those quantities. Several methods are tested on synthetic and simulation data. Our approach is dedicated to detect minor species spectra rather than precisely quantifying them. On synthetic example, this approach is able to detect chemical compounds present in form of 100 hidden spectra out of 104, at 1.5 times the noise level. Results on simulated spectra of NOMAD-SO targeting CH4 show that detection limits goes in the range of 100–500 ppt in favorable conditions. Results on real martian data from NOMAD-SO show that CO2 and H2O are present, as expected, but CH4 is absent. Nevertheless, we confirm a set of new unexpected lines in the database, attributed by ACS instrument Team to the CO2 magnetic dipole.
    Citation
    Schmidt, F.; Cruz Mermy, G.; Erwin, J.; Robert, S.; Neary, L.; Thomas, I.R.; Daerden, F.; Ristic, B.; Patel, M.R.; Bellucci, G.; Lopez-Moreno, J.-J.; Vandaele, A.-C. (2021). Machine learning for automatic identification of new minor species. , Journal of Quantitative Spectroscopy and Radiative Transfer, Vol. 259, A107361, DOI: 10.1016/j.jqsrt.2020.107361.
    Identifiers
    uri: https://orfeo.belnet.be/handle/internal/7641
    doi: http://dx.doi.org/10.1016/j.jqsrt.2020.107361
    Type
    Article
    Peer-Review
    Yes
    Language
    eng
    Links
    NewsHelpdeskBELSPO OA Policy

    Browse

    All of ORFEOCommunities & CollectionsBy Issue DateAuthorsTitlesDisciplinesThis CollectionBy Issue DateAuthorsTitlesDisciplines
     

    DSpace software copyright © 2002-2016  DuraSpace
    Send Feedback | Cookie Information
    Theme by 
    Atmire NV