Date
2020
Dewey
Organisation des données
Sujet
JSON
Conference name
23rd International Conference on Extending Database Technology, EDBT 2020
Conference date
2020
Conference country
DENMARK
Book title
EDBT 2020 - 23nd International Conference on Extending Database Technology
Publisher
Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik
ISBN
978-3-89318-083-7
Author
Baazizi, Mohamed-Amine
Berti, Clément
Colazzo, Dario
Ghelli, Giorgio
Sartiani, Carlo
Type
Communication / Conférence
Item number of pages
635-638
Abstract (EN)
JSON established itself as a popular data format for representing data whose structure is irregular or unknown a priori. JSON collections are usually massive and schema-less. Inferring a schema describing the structure of these collections is crucial for formulating meaningful queries and for adopting schema-based optimizations. In a recent work, we proposed a Map/Reduce schema inference approach that either infers a compact representation of the input collection or a precise description of every possible shape in the data. Since no level of precision is ideal, it is more appealing to give the analyst the freedom of choosing between different levels of precisions in an interactive fashion. In this paper we describe a schema inference system offering this important functionality.