ChaosSearch was built for live analytics at scale on cloud storage. Our architecture was designed for high volume ingestion of streams & analytics at scale via ElasticSearch & Trino API via a stateless fabric that can scale to meet the customers’ scale & latency requirements. Because we don’t store any data, under the hood, ChaosSearch is basically a set of containers that are deployed in cloud compute instances in a dedicated VPC to each customer managed by ChaosSearch. There are two types of containers run:
ChaosSearch has two pricing models (ingest & worker-based) which can be used to optimize pricing based on customers’ preferences.
The ingest-based pricing model is easier to compare with alternative managed services in the observability space (as they typically price based on the same/similar metric) & allows customers to easily estimate their cost based on their ingest. With the ingest-based pricing model, the customer pays $3,000/tenant-region (i.e. # of regions for each tenant) & commits to a certain level of ingestion (e.g. 10,000GB/day) at a certain committed price (e.g. $0.21/GB for 10,000GB/day). Then monthly, if the average daily ingest is above the committed rate there’s an on-demand fee charged at 30% above the committed rate (e.g. $0.27/GB). This pricing model is good for customers that want a managed service for log analytics with predictability of spend based on volume of data at a fraction of the cost of alternatives in the market. It provides the ability to have unlimited retention at a fraction of the cost of ELK-based companies like Logz.io or Mezmo ($0.80-$1.80/GB+ depending on retention), Datadog ($0.10/GB + $1.06-$2.50/M events), Splunk ($2.20/GB), Sumo Logic ($3/GB), which allows you to either replace or complement them in your stack.
The worker-based pricing provides greater flexibility and allows customers to only pay for the compute resources they use with querying available either via Elasticsearch API & Opensearch Dashboards & Trino API & Superset. With the worker-based pricing model, the customer pays $1,000/tenant & a price for the number of workerhours used (i.e. $0.20/workerhour in US regions & up to 50% above that for other cloud regions with a minimum of 12 workers available at all times per region). Each tenant requires a certain number of workers always up to continuously ingest data & can scale up the number of workers (either based on time of day or login policy) to service querying needs (each worker is used to fetch data from cloud storage on each query in a distributed system fashion). Given that ChaosSearch’s ingest is highly efficient, for a single stream (i.e. single object group) with relatively tight schema & constant well-sized file throughput, 24 workers (i.e. $4,500/mo) can ingest up to 5,000GB/day, being very efficient for ingestion of high volumes of data. Adequate number of workers per environment will depend on specifics of data ingestion & query access patterns so it’s better assessed in ChaosSearch’s free proof-of-value (POV).
Single tenant with single us-east-1 region and a stream of VPC flow logs with 20,000GB/day and infrequent querying
Single tenant with single us-east-1 region with multiple streams with large nested schema spikey ingest at 1,000GB/day with continuous & spikey querying & alerting activity with multiple users
If you want a log analytics managed service for observability, our ingest-based pricing model can give you significant savings vs. alternatives. If your use case has high ingest with spikey access, our worker-based pricing model allows you to take full advantage of our architecture & scale workers to meet your needs with superior cost economics, unleashing access to high volumes of data in near real-time across access modes (Elasticsearch & Trino API) at a fraction of the cost of alternatives in the market.