Probabilistic Data Structures and Algorithms for Big Data ApplicationsBoD – Books on Demand, 05.08.2022 - 222 Seiten A technical book about popular space-efficient data structures and fast algorithms that are extremely useful in modern Big Data applications. The purpose of this book is to introduce technology practitioners, including software architects and developers, as well as technology decision makers to probabilistic data structures and algorithms. Reading this book, you will get a theoretical and practical understanding of probabilistic data structures and learn about their common uses. |
Häufige Begriffe und Wortgruppen
approximate Big Data applications binary bit array buffer candidate buckets candidate cluster canonical bucket cardinality estimation centroid compression compute Consider corresponding bits cosine similarity Count Sketch Count-Min Sketch COUNTER[j Counting Bloom filter cryptographic hash cryptographic hash functions Cuckoo filter Cuckoo hashing data stream decrement deletion digest property empty Example false positive fingerprint formula frequencies of elements frequency estimation frequent elements Hamming distance hash collisions hash functions hash table hash values HyperLogLog algorithm increment insert integers Jaccard similarity length Linear probing locality-sensitive hash LogLog algorithm memory merged minwise hashing MurmurHash3 nearest neighbor nearest neighbor search number of elements p-bit permutations probabilistic data structures probability processing q-quantile QF[i quantile value Quotient filter random rank remainder requires sequence signature matrix SimHash signature similarity threshold simple counter space space-efficient standard error T-DIGEST data structure total number update vector zero

