Popularity

4.5

Growing

Activity

4.9

Stars 468

Watchers 15

Forks 60

Last Commit 23 days ago

Description

Bloom filter is a space-efficient probabilistic data structure that is used to test whether an element is a member of a set. False positives are possible, but false negatives are not. For more detail, check the wikipedia article. Instead of using k different hash functions, this implementation seeds the CRC32 hash with k different initial values (0, 1, ..., k-1). This may or may not give you a good distribution, it all depends on the data.

Performance of the Bloom filter depends on a number of variables:

Code Quality Rank: L5

Monthly Downloads: 3,742

Programming language: C

Tags: Scientific Specific Projects

Latest version: v2.1.1

bloomfilter-rb alternatives and similar gems

Based on the "Specific" category.
Alternatively, view bloomfilter-rb alternatives based on common mentions on social networks and blogs.

decisiontree

6.7 0.0 L5 bloomfilter-rb VS decisiontree

ID3-based implementation of the ML Decision Tree algorithm
BioRuby

4.7 6.9 L2 bloomfilter-rb VS BioRuby

bioruby

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

Promo workos.com

Sysrandom

2.0 0.0 L4 bloomfilter-rb VS Sysrandom

Secure random number generation for Ruby using system RNG facilities

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of bloomfilter-rb or a related project?

Add another 'Specific' Gem

Popular Comparisons

README

BloomFilter(s) in Ruby

Native (MRI/C) counting bloom filter
Redis-backed getbit/setbit non-counting bloom filter
Redis-backed set-based counting (+TTL) bloom filter

Performance of the Bloom filter depends on a number of variables:

size of the bit array
size of the counter bucket
number of hash functions

Resources

Determining parameters: Scalable Datasets: Bloom Filters in Ruby
Applications & reasons behind bloom filter: Flow analysis: Time based bloom filter

MRI/C API Example

MRI/C implementation which creates an in-memory filter which can be saved and reloaded from disk.

require 'bloomfilter-rb'

bf = BloomFilter::Native.new(:size => 100, :hashes => 2, :seed => 1, :bucket => 3, :raise => false)
bf.insert("test")
bf.include?("test")     # => true
bf.include?("blah")     # => false

bf.delete("test")
bf.include?("test")     # => false

# Hash with a bloom filter!
bf["test2"] = "bar"
bf["test2"]             # => true
bf["test3"]             # => false

bf.stats
# => Number of filter bits (m): 10
# => Number of filter elements (n): 2
# => Number of filter hashes (k) : 2
# => Predicted false positive rate = 10.87%

Redis-backed setbit/getbit bloom filter

Uses getbit/setbit on Redis strings - efficient, fast, can be shared by multiple/concurrent processes.

bf = BloomFilter::Redis.new

bf.insert('test')
bf.include?('test')     # => true
bf.include?('blah')     # => false

bf.delete('test')
bf.include?('test')     # => false

Memory footprint

1.0% error rate for 1M items, 10 bits/item: 2.5 mb
1.0% error rate for 150M items, 10 bits per item: 358.52 mb
0.1% error rate for 150M items, 15 bits per item: 537.33 mb

Redis-backed counting bloom filter with TTLs

Uses regular Redis get/set counters to implement a counting filter with optional TTL expiry. Because each "bit" requires its own key in Redis, you do incur a much larger memory overhead.

bf = BloomFilter::CountingRedis.new(:ttl => 2)

bf.insert('test')
bf.include?('test')     # => true

sleep(2)
bf.include?('test')     # => false

Credits

Tatsuya Mori [email protected] (Original C implementation: http://vald.x0.com/sb/)

License

*Note that all licence references and agreements mentioned in the bloomfilter-rb README section above are relevant to that project's source code only.

bloomfilter-rb

BloomFilter(s) in Ruby: Native counting filter + Redis counting/non-counting filters