Apache Accumulo is a sorted, distributed key/value store and is at the core of Sqrrl Enterprise. It handles large amounts of structured, semi-structured, and unstructured data as a robust, scalable, and real-time data storage and retrieval system. Accumulo is inspired by Google’s BigTable paper.
Originally developed by the NSA beginning in 2008, Accumulo is now an open source software project hosted by the Apache Foundation and natively integrates with Apache Hadoop. Accumulo is a low latency, non-relational database and uses Hadoop as its file system for storage.
Apache Accumulo has three unique technical advantages over other comparable non-relational or NoSQL database solutions:
- Security: Fine-grained security controls allow organizations to control data at the cell-level without degrading performance.
- Performance: Accumulo is proven to operate and perform at massive scale (i.e., tens of petabytes of data) with low administrative overhead. Accumulo also features very fast reads and writes (10,000s operations per second per node) to support interactive queries and high throughput.
- Flexibility: Accumulo can easily handle multi-structured and sparse datasets without extensive data modeling.
Sqrrl Enterprise builds on these advantages and offers a number of additional features that make Accumulo easier-to-use, more powerful in terms of search and query capabilities, and even more secure. For more information, please see our Architecture page.