• xmlui.mirage2.page-structure.header.title
    • français
    • English
  • Help
  • Login
  • Language 
    • Français
    • English
View Item 
  •   BIRD Home
  • LAMSADE (UMR CNRS 7243)
  • LAMSADE : Publications
  • View Item
  •   BIRD Home
  • LAMSADE (UMR CNRS 7243)
  • LAMSADE : Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

BIRDResearch centres & CollectionsBy Issue DateAuthorsTitlesTypeThis CollectionBy Issue DateAuthorsTitlesType

My Account

LoginRegister

Statistics

Most Popular ItemsStatistics by CountryMost Popular Authors
Thumbnail - Request a copy

AS-Index: A Structure For String Search Using n-grams and Algebraic Signatures

du Mouza, Cedric; Litwin, Witold; Rigaux, Philippe; Schwarz, Thomas (2009), AS-Index: A Structure For String Search Using n-grams and Algebraic Signatures, CIKM '09: Conference on Information and Knowledge Management, 2009-11, Hong-Kong, China

Type
Communication / Conférence
Date
2009
Conference title
CIKM '09: Conference on Information and Knowledge Management
Conference date
2009-11
Conference city
Hong-Kong
Conference country
China
Book author
Cheung, David; Song, Il Yeol
Publisher
ACM - Association for Computing Machinery
Published in
New York, NY
ISBN
978-1-60558-512-3
Pages
295-304
Publication identifier
10.1145/1645953.1645993
Metadata
Show full item record
Author(s)
du Mouza, Cedric

Litwin, Witold
Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Rigaux, Philippe cc

Schwarz, Thomas
Abstract (EN)
AS-Index is a new index structure for exact string search in disk resident databases. It uses hashing, unlike known alternatives, whether baesd on trees or tries. It typically indexes every n-gram in the database, though non-dense indexing is possible. The hash function uses the algebraic signatures of n-grams. Use of hashing provides for constant index access time for arbitrarily long patterns, unlike other structures whose search cost is at best logarithmic. The storage overhead of AS-Index is basically 500 - 600%, similar to that ofalternatives or smaller. We show the index structure, our use of algebraic signatures and the search algorithm. We present the theoretical and experimental performance analysis. We compare the AS-Index to main alternatives. We conclude that AS-Index is an attractive structure and we indicate directions for future work.
Subjects / Keywords
Full text indexing; Large scale indexing; L'indexation de texte intégral; Indexation de grande échelle

Related items

Showing items related by title and author.

  • Thumbnail
    AS-Index: A Structure For String Search Using n-grams and Algebraic Signatures 
    Constantin, Camelia; du Mouza, Cedric; Litwin, Witold; Rigaux, Philippe; Schwarz, Thomas (2016) Article accepté pour publication ou publié
  • Thumbnail
    AS-Index: A Structure for String Search Using n-Grams and Algebraic Signatures 
    Rigaux, Philippe; Litwin, Witold; du Mouza, Cédric; Schwarz, Thomas (2009) Communication / Conférence
  • Thumbnail
    Fast nGram-Based String Search Over Data Encoded Using Algebraic Signatures 
    Litwin, Witold; Mokadem, Riad; Rigaux, Philippe; Schwartz, Thomas (2007) Communication / Conférence
  • Thumbnail
    Cumulative Algebraic Signatures for Fast String Search, Protection Against Incidental Viewing and Corruption of Data in an SDDS 
    Litwin, Witold; Mokadem, Riad; Schwarz, Thomas (2007) Communication / Conférence
  • Thumbnail
    Dynamic Storage Balancing in a Distributed Spatial Index 
    Rigaux, Philippe; Litwin, Witold; du Mouza, Cédric (2007) Communication / Conférence
Dauphine PSL Bibliothèque logo
Place du Maréchal de Lattre de Tassigny 75775 Paris Cedex 16
Phone: 01 44 05 40 94
Contact
Dauphine PSL logoEQUIS logoCreative Commons logo