Skip to main content

Prometheus Storage

Prometheus Storage

On average, Prometheus uses only around 1-2 bytes per sample. Thus, to plan the capacity of a Prometheus server, you can use the rough formula:
1
needed_disk_space = retention_time_seconds * ingested_samples_per_second * bytes_per_sample

Concepts

  • chunks this data together in chunks of constant size (1024 bytes)
  • keeps all the currently used (incomplete) chunks in memory
  • batch up write operations to store chunks on disk

Optimize

Memory Options

  • storage.local.memory-chunks
how many chunks can Prometheus keep in memory. Remember, it’s the number of chunks, not the size in bytes.
Suggested value:  / 1024 / 6.
  • storage.local.max-chunks-to-persist
how many chunks can be waiting to be written to the disk.
Suggested value: memory-chunks / 2

The rushed mode

When the number of chunks in memory, waiting to be persisted to disk, grows too much, Prometheus enters the rushed mode and speed up persisting chunks.
Prometheus calculate an urgency score, as the number of chunks waiting for persistence in relation to max-chunks-to-persist and on how much the number of chunks in memory exceeds the memory-chunks.
  • urgency_score > 0.8: Enter Rushed Mode
  • urgency_score < 0.7: Leave Rushed Mode

Snapshot

Snapshot API

Prometheus V2.0 提供了snapshot机制用于TSDB Backend数据备份, 创建快照API开启方式: --web.enable-admin-api
1
2
3
4
5
6
7
8
$ curl -XPOST http://localhost:9090/api/v2/admin/tsdb/snapshot
$ curl -X POST http://prometheus.in.dataengine.com/api/v2/admin/tsdb/snapshot
{
"name": "2017-12-05T06:42:03Z-32d47b4db4c3e108"
}
Data Directory
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
data
├── 01C0JGQ905RK2ZZCQS8F7VSNV7
│   ├── chunks
│   ├── index
│   ├── meta.json
│   └── tombstones
├── 01C0JQJTVZ6JC5MQJFSG145979
│   ├── chunks
│   ├── index
│   ├── meta.json
│   └── tombstones
├── lock
├── snapshots
│   ├── 2017-12-05T06:42:03Z-32d47b4db4c3e108
│   └── 2017-12-05T06:42:44Z-6be0c2980c1989c5
└── wal
├── 000001
└── 000002
Snapshot Directory
1
2
3
4
5
6
7
2017-12-05T06:42:44Z-6be0c2980c1989c5
└── 01C0K07PSNB97D6TXJQ7YE9ST1
├── chunks
│   └── 000001
├── index
├── meta.json
└── tombstones
meta.json
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
{
"version": 1,
"ulid": "01C0JPHXJ8BBG194HPNZFP7AR7",
"minTime": 1512446400000,
"maxTime": 1512456122548,
"stats": {
"numSamples": 321304,
"numSeries": 604,
"numChunks": 3020
},
"compaction": {
"level": 1,
"sources": [
"01C0JPHXJ8BBG194HPNZFP7AR7"
]
}
}
  • maxTime - minTime = storage.tsdb.min-block-duration (default: 2h)
  • meta.json 文件存储了 series 起止时间 / metircs 数量 / block 目录名称;
  • 每个snapshot 由一个或多个 Chunk Block 组成, 这些Block 共同存储了Prometheus TSDB 的全量数据;
  • Chunk Block 生成的数量由 storage.tsdb.min-block-duration 和 storage.tsdb.max-block-duration 控制;
1
2
caller=compact.go:361 component=tsdb msg="compact blocks" count=1
mint=1512532800000 maxt=1512539893781
Prometheus Version: 1.7.0
1
time="2017-11-20T03:58:23+08:00" level=error msg="Storage needs throttling. Scrapes and rule evaluations will be skipped." chunksToPersist=92938 memoryChunks=493334 source="storage.go:1007" urgencyScore=1

Refers

Research

  • Prometheus WAL Log
  • Chunk Data
  • Snapshot

Comments

Popular posts from this blog

OWASP Top 10 Threats and Mitigations Exam - Single Select

Last updated 4 Aug 11 Course Title: OWASP Top 10 Threats and Mitigation Exam Questions - Single Select 1) Which of the following consequences is most likely to occur due to an injection attack? Spoofing Cross-site request forgery Denial of service   Correct Insecure direct object references 2) Your application is created using a language that does not support a clear distinction between code and data. Which vulnerability is most likely to occur in your application? Injection   Correct Insecure direct object references Failure to restrict URL access Insufficient transport layer protection 3) Which of the following scenarios is most likely to cause an injection attack? Unvalidated input is embedded in an instruction stream.   Correct Unvalidated input can be distinguished from valid instructions. A Web application does not validate a client’s access to a resource. A Web action performs an operation on behalf of the user without checkin...

CKA Simulator Kubernetes 1.22

  https://killer.sh Pre Setup Once you've gained access to your terminal it might be wise to spend ~1 minute to setup your environment. You could set these: alias k = kubectl                         # will already be pre-configured export do = "--dry-run=client -o yaml"     # k get pod x $do export now = "--force --grace-period 0"   # k delete pod x $now Vim To make vim use 2 spaces for a tab edit ~/.vimrc to contain: set tabstop=2 set expandtab set shiftwidth=2 More setup suggestions are in the tips section .     Question 1 | Contexts Task weight: 1%   You have access to multiple clusters from your main terminal through kubectl contexts. Write all those context names into /opt/course/1/contexts . Next write a command to display the current context into /opt/course/1/context_default_kubectl.sh , the command should use kubectl . Finally write a second command doing the same thing into ...