• xmlui.mirage2.page-structure.header.title
    • français
    • English
  • Help
  • Login
  • Language 
    • Français
    • English
View Item 
  •   BIRD Home
  • CEREMADE (UMR CNRS 7534)
  • CEREMADE : Publications
  • View Item
  •   BIRD Home
  • CEREMADE (UMR CNRS 7534)
  • CEREMADE : Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

BIRDResearch centres & CollectionsBy Issue DateAuthorsTitlesTypeThis CollectionBy Issue DateAuthorsTitlesType

My Account

LoginRegister

Statistics

Most Popular ItemsStatistics by CountryMost Popular Authors
Thumbnail - Request a copy

Thinking by classes in Data Science: the symbolic data analysis paradigm

Diday, Edwin (2016), Thinking by classes in Data Science: the symbolic data analysis paradigm, Wiley Interdisciplinary Reviews. Computational Statistics, 8, 5, p. 172–205. 10.1002/wics.1384

Type
Article accepté pour publication ou publié
Date
2016
Journal name
Wiley Interdisciplinary Reviews. Computational Statistics
Volume
8
Number
5
Publisher
Wiley
Pages
172–205
Publication identifier
10.1002/wics.1384
Metadata
Show full item record
Author(s)
Diday, Edwin
CEntre de REcherches en MAthématiques de la DEcision [CEREMADE]
Abstract (FR)
Penser en terme de classes en Science des données
Abstract (EN)
Data Science, considered as a science by itself, is in general terms, the extraction of knowledge from data. Symbolic data analysis (SDA) gives a new way of thinking in Data Science by extending the standard input to a set of classes of individual entities. Hence, classes of a given population are considered to be units of a higher level population to be studied. Such classes often represent the real units of interest. In order to take variability between the members of each class into account, classes are described by intervals, distributions, set of categories or numbers sometimes weighted and the like. In that way, we obtain new kinds of data, called ‘symbolic’ as they cannot be reduced to numbers without losing much information. The first step in SDA is to build the symbolic data table where the rows are classes and the variables can take symbolic values. The second step is to study and extract new knowledge from these new kinds of data by at least an extension of Computer Statistics and Data Mining to symbolic data. SDA is a new paradigm which opens up a vast domain of research and applications by giving complementary results to classical methods applied to standard data. SDA also gives answers to big data and complex data challenges as big data can be reduced and summarized by classes and as complex data with multiple unstructured data tables and unpaired variables can be transformed into a structured data table with paired symbolic‐valued variables.
Subjects / Keywords
data science; data mining; classification; learning; symbolic data analysis; functional analysis; Bayesian; multilevel analysis; complex data; big data; granular computing; compositional data; Science des données; Apprentissage Automatique à base de corpus

Related items

Showing items related by title and author.

  • Thumbnail
    Mixture decomposition of distributions by copulas in the symbolic data analysis framework 
    Vrac, Mathieu; Diday, Edwin (2005) Article accepté pour publication ou publié
  • Thumbnail
    Strategies evaluation in environmental conditions by symbolic data analysis: application in medicine and epidemiology to trachoma 
    Guinot, Christiane; Malvy, Denis; Schémann, Jean-François; Afonso, Filipe; Haddad, Raja; Diday, Edwin (2015) Article accepté pour publication ou publié
  • Thumbnail
    A Generalisation of the Mixture Decomposition Problem in the Symbolic Data Analysis Framework 
    Diday, Edwin (2001) Document de travail / Working paper
  • Thumbnail
    From the statistics of data to the statistics of knowledge: Symbolic data analysis. 
    Billard, Lynne; Diday, Edwin (2003) Article accepté pour publication ou publié
  • Thumbnail
    Data analysis and informatics. Proceedings of the Second international Symposium on Data Analysis and Informatics, organised by the Institut de Recherche d'Informatique et d'automatique, Versailles, October 17-19, 1979. 
    Tomassone, R.; Pagès, J.P.; Lebart, Ludovic; Diday, Edwin (1979-10) Ouvrage
Dauphine PSL Bibliothèque logo
Place du Maréchal de Lattre de Tassigny 75775 Paris Cedex 16
Phone: 01 44 05 40 94
Contact
Dauphine PSL logoEQUIS logoCreative Commons logo