InfluxDB Internals

InﬂuxDB Internals Platform Engineering Team  @ryanbetts / ryan@inﬂuxdata.com

How great are databases? • I like making things with smart, clever, kind people. • I’ve been working on high-throughput, realtime data for the last 10 years.

• What’s so special about time series • Time series database designs • InﬂuxDB internals

RDBMS NoSQL TSDB Correctness ACID BASE BASE Schema DDL DDL / documents on-write Writing data DML POST/PUT line protocol Reading data SQL GET + ﬁlter ﬁlter, window, group, join

TSDB unique combination • Ingest: thousands to millions of points per second • Store: fast accumulating, append-mostly data, lots of repetition, often with time-to-live • Query: analytic queries with fast ﬁltering, windowing • Scale: availability, storage, query

Facebook Gorilla • TTL eviction • Columnar compression • Write availability > query correctness • Metric-based schema • Separate query processing from access-path

Druid • Roll-up at ingest • Columnar storage & time-based segments • Indexes on dimension for fast ﬁltering • Separation of real time and historical data nodes

Bullet Journals • Fast event recording • Ordered by time • Indexed by dimensions • Weekly / Monthly roll-up

InﬂuxDB 1.Write Path 2.Storage 3.Query Path 4.Clustering

InﬂuxDB: Adding data (1) POST ’http://localhost:8086/write?db=mydb' --data- binary 'cpu_load_short,host=server01,region=us-west value=0.64 1434055562000000000’

InﬂuxDB: Adding data (2) fsync( ) batch to WAL Add to in- memory cache Snapshot cache to TSM Add to index

InﬂuxDB: on-disk (ﬁlesystem) CREATE RETENTION POLICY <retention_policy_name> ON <database_name> DURATION <duration> REPLICATION <n> [SHARD DURATION <duration>] [DEFAULT] Database directory /db Retention Policy directory /db/rp Shard Group (time bounded) (Logical) Shard directory (db/rp/Id#) TSM0001.tsm (data file) TSM0002.tsm (data file)

InﬂuxDB: Adding data (DB) fsync( ) batch to WAL Add to in- memory cache Snapshot to TSM Add to index

InﬂuxDB: Adding data (index) • Measurement name -> ﬁeld keys • Measurement name -> series • Measurement name -> tag keys -> tag value -> series • Series -> shards • (Also sketches of series and measurements for fast cardinality estimation)

InﬂuxDB: TSI • Roaring-bitmaps to short- cut series creation on insert • Iterators for index mappings • Index is per-shard; series id ﬁle is per-database • Partitioned for lock-splitting

InfluxDB: InfluxQL Queries 1. Parses time range and expressions for filtering data 2. Look-up shards to access using the list of measurements and the time frame 3. Create the iterators for each shard 4. Merge the shard iterator outputs select user, system from cpu where time > now() - 1h and host = 'serverA

InfluxQL: Query with IFQL 1. Stand-alone ìfqld` coordinator nodes 2. Streaming storage iterators that support rate-limits 3. Separation of query planning and query distribution 4. Extensible, functional language 5. Unification of InfluxQL and TICKScript

A brief sidebar on append-mostly databases No one tells you about: * Wrong data * Old (back-ﬁlled) data

InfluxDB Clustering • Strongly consistent meta-cluster (based on RAFT) • User configured replication factor • Replication and shard aware query planner • Hinted-Handoff queues on each data node • (WIP) Anti-entropy consistency repair

Conclusions • Time series data has unique storage and query requirements that impact database design.  • Evolution of InﬂuxDB: 1. TSI: remove the in-memory size limit on cardinality 2. IFQL: faster feature velocity; safer execution. 3. Anti-entropy repair: easier, more robust scale-out.

InfluxDB Internals

More Related Content

What's hot

Similar to InfluxDB Internals

More from InfluxData

Recently uploaded

InfluxDB Internals