This filters library contains the InfiniFilter paper from SIGMOD 2023 as well as some baselines against which it can be compared. InfiniFilter is a probabilistic data structure that answers set-membership queries (similar to a Bloom filter or a Quotient Filter). The core innovation is that InfiniFilter can dynamically expand along with the data while maintaining good guarantees over performance and the false positive rate. The full paper can be accessed here. A video lecture on InfiniFilter appears on youtube.
Compile the library by running the following command from the root directory of the repository.
javac filters/*.java bitmap_implementations/*.java infiniFilter_experiments/*.java
To reproduce the results of the SIGMOD 2023 paper, run the script run_exps.sh from the root directory of the repository. The script will reproduce Figures 13 to 16 in the paper and name them accordingly: figure13.png to figure.16.png. The raw data used to generate these figures is stored in a group of directories called exp1 to exp6. Running the script again will delete the results of older runs and regenerate them.
The script takes a parameter for power of the maximum number of slots to which it will expand the filter. For example, an argument of 31 means the filter will grow to a size of up to 2^31 slots during each experiment. An argument of 31 should be used to reproduce the paper's results. The default value for this parameter is 24.
./run_exps.sh 31
Running the experiments requires having the Java Development Kit version 11 or above installed. Plotting the figures to illustrate the experimental results requires having Gnuplot installed. Both dependencies can be obtained using the following commands on Ubuntu or equivalents on other systems.
sudo apt install default-jdk
sudo apt-get install gnuplot