Elasticsearch Ecosystem - Part1

Elasticsearch is distributed, search and analysis engine. There are many use cases which are supported by Elasticsearch- Log Analysis, Monitoring of Infrastructures, APM, Security analysis, Business analysis and applications. Data gets ingested into Elasticsearch from various sources. Mainly logs and metrices are ingested into Elasticsearch. We can design ingestion pipeline to process and transform data before it is placed inside Elasticsearch. To enable this, Elasticsearch eco system has various tools which we will discuss.

We have 4 main areas in which tools can be categorized but these are not a hard categories and has overlapping features -
1. Data exporter/shipper - configured at data source
2. Data collector - listening for data from various log sources
3. Data storage - storing/indexing data
4. Data visualization - visualization of data

Data shipper:-

Elastic-beats and Fluent-bit can be used for log shipper. Both are light weight product and be installed as agent and mainly used for transport purposes.
Elastic-beats - use oss distribution if you are not using Elasticsearch licensed product.



Fluent Bit is an open source Log Processor and Forwarder which allows you to collect any data like metrics and logs from different sources, enrich them with filters and send them to multiple destinations. 

Data collector :-

Logstash and Fluentd can be used for logs collector.

Logstash 

works on three components - INPUT plugins, FILTER plugins and OUTPUT plugins.
Use oss distribution if you are not using Elasticsearch licensed product.

Fluentd


Fluentd vs Fluent-bit

Both Logstash and Fluentd will coexist for various use cases for legacy or modern architecture. For kubernetes architecture fluentd is preferred because it has built-in Docker logging driver and parser and due to this, containers do not need an extra agent to push the logs to fluentd.

Data storage :- 

Open Distro for Elasticsearch is open source and maintained by AMAZON.
Elasticsearch has both opensource and licensed product.

Data visualization :-


Kibana used for visualization tool.



Comments

Popular posts from this blog

Python Development Environment : pyenv & VS Code

Open Distro for Elasticsearch - Installation - Part 2

WSL 2 on Windows - LINUX based development environment and Docker