A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
-
Updated
Jun 6, 2022 - Python
A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
A simple, time-tested, family of random hash functions in Java, based on CRC32, affine transformations, and the Mersenne Twister. 🎲
python implementations of the Flajolet-Martin, LogLog, SuperLogLog, and HyperLogLog cardinality estimation algorithms, specifically used to estimate the cardinality of unique traffic violations in NYC in the 2019 fiscal year
Comparative Analysis of Unsupervised Learning Methods for Real-time Anomaly Detection in Industrial Control Systems (ICS)
Basic implementation of Bloom filter and Flajolet-Martin algorithms in python with hashes and test files
Python implementation of Flajolet-Martin and Bloom Filter algorithms for massive data streams. Features lazy evaluation and strict O(1) RAM constraints.
Simple Projects in Data Mining
Streaming data in Spark and doing data analytics
Bloom filtering, Flajolet-Martin algorithm, and reservoir sampling
This repository contains the assignments and project codes created during the Big data coursework
Coursework for the PhD course DT8116 - Mining of Massive Datasets, NTNU Norway
Add a description, image, and links to the flajolet-martin topic page so that developers can more easily learn about it.
To associate your repository with the flajolet-martin topic, visit your repo's landing page and select "manage topics."