Simon Pasquier
a30348f1a4
discovery: add config label to discovered targets metric ( #4753 )
...
* discovery: add labels to discovered targets metric
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
2018-10-18 16:46:59 +01:00
Callum Styan
9bca041285
WIP: keep track of samples per query, set a max # of samples ( #4513 )
...
* keep track of samples per query, set a max # of samples that can be in
memory at once
Signed-off-by: Callum Styan <callumstyan@gmail.com>
2018-10-02 12:59:19 +01:00
Tom Wilkie
4c52400708
Limit concurrent remote reads. ( #4656 )
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
2018-09-25 20:07:34 +01:00
Tom Wilkie
457e4bb58e
Limit the number of samples remote read can return. ( #4532 )
...
* Limit the number of samples remote read can return.
- Return 413 entity too large.
- Limit can be set be a flag. Allow 0 to mean no limit.
- Include limit in error message.
- Set default limit to 50M (* 16 bytes = 800MB).
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
2018-09-05 15:50:50 +02:00
Chris Marchbanks
63ed9d1b70
Send EndsAt along with alerts ( #4550 )
...
Signed-off-by: Chris Marchbanks <csmarchbanks@gmail.com>
2018-08-28 16:05:00 +01:00
Chris Marchbanks
87f1dad16d
throttle resends of alerts to 1 minute by default ( #4538 )
...
Signed-off-by: Chris Marchbanks <csmarchbanks@gmail.com>
2018-08-27 17:41:42 +01:00
Krasi Georgiev
12fe204ea6
move runtime debug funcs in own package ( #4494 )
...
To make local debuging with `go run` easyer moved all files into a
dedicate package `runtime`.
This allows running prometheus just by using `go run main.go` instead of
passing mani files like `go run main.go limits_default.go ...`
Signed-off-by: Krasi Georgiev <kgeorgie@redhat.com>
2018-08-22 13:41:11 +03:00
Simon Pasquier
08c2f50382
Merge pull request #4418 from simonpasquier/log-vm-limits
...
prometheus: log virtual memory limits
2018-08-07 16:27:46 +02:00
Julius Volz
90521a65f8
Remove error return value from NotifyFunc() ( #4459 )
...
It's always nil and we also forgot to check it.
Signed-off-by: Julius Volz <julius.volz@gmail.com>
2018-08-04 21:31:12 +02:00
Ganesh Vernekar
f1db699dff
Persist alert 'for' state across restarts ( #4061 )
...
Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in>
2018-08-02 11:18:24 +01:00
Simon Pasquier
a94450c288
Fix build for openbsd
...
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
2018-07-31 14:41:30 +02:00
Simon Pasquier
141c188ae6
Enforce conversion for freebsd
...
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
2018-07-26 14:58:56 +02:00
Simon Pasquier
208d21a393
Add comment and print units
...
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
2018-07-26 10:26:58 +02:00
Simon Pasquier
ba22b10113
prometheus: log virtual memory limits
...
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
2018-07-25 15:51:27 +02:00
Julius Volz
03aa3a3de8
main: Improve / clean up error messages ( #4286 )
...
Signed-off-by: Julius Volz <julius.volz@gmail.com>
2018-07-18 09:58:40 +02:00
Brian Brazil
68e8b80ffe
Reorder startup and shutdown to prevent panics. ( #4321 )
...
Start rule manager only after tsdb and config is loaded.
Stop rule manager before tsdb to avoid writing to closed storage.
Wait for any in-progress reloads to complete before shutting
down rule manager, so that rule manager doesn't get updated after
being shut down.
Remove incorrect comment around shutting down query enginge.
Log when config reload is completed.
Fixes #4133
Fixes #4262
Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>
2018-07-04 13:41:16 +01:00
Michael Khalil
78e0784d04
return error exit status in prometheus cli ( #4296 )
...
Signed-off-by: mikeykhalil <mikeyfkhalil@gmail.com>
2018-06-21 08:32:26 +01:00
Tom Wilkie
8acad5f3cd
make it compile
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
2018-05-24 15:40:24 +01:00
Tom Wilkie
e51d6c4b6c
Make remote flush deadline a command line param.
...
Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>
2018-05-23 15:06:01 +01:00
Mario Trangoni
464e747f1e
fix some comments typos ( #4059 )
2018-04-08 10:51:54 +01:00
Sneha Inguva
7be846754a
main: actor functionality comments
2018-04-01 11:19:30 -07:00
Marek Siarkowicz
bb86c3f62b
Report internal runtime information on status page ( #3921 )
...
Add information about tsdb, wal and config reload
2018-03-21 16:08:37 +00:00
James Turnbull
ba5273a0ab
Minor edits to help text ( #3990 )
2018-03-20 16:54:36 +00:00
Simon Pasquier
e1fd96db25
cmd: fix help text ( #3989 )
2018-03-20 15:58:19 +00:00
ferhat elmas
ffa673f7d8
General simplifications ( #3887 )
...
Another try as in #1516
2018-02-26 07:58:10 +00:00
Bartek Plotka
93a63ac5fd
api: Added v1/status/flags endpoint. ( #3864 )
...
Endpoint URL: /api/v1/status/flags
Example Output:
```json
{
"status": "success",
"data": {
"alertmanager.notification-queue-capacity": "10000",
"alertmanager.timeout": "10s",
"completion-bash": "false",
"completion-script-bash": "false",
"completion-script-zsh": "false",
"config.file": "my_cool_prometheus.yaml",
"help": "false",
"help-long": "false",
"help-man": "false",
"log.level": "info",
"query.lookback-delta": "5m",
"query.max-concurrency": "20",
"query.timeout": "2m",
"storage.tsdb.max-block-duration": "36h",
"storage.tsdb.min-block-duration": "2h",
"storage.tsdb.no-lockfile": "false",
"storage.tsdb.path": "data/",
"storage.tsdb.retention": "15d",
"version": "false",
"web.console.libraries": "console_libraries",
"web.console.templates": "consoles",
"web.enable-admin-api": "false",
"web.enable-lifecycle": "false",
"web.external-url": "",
"web.listen-address": "0.0.0.0:9090",
"web.max-connections": "512",
"web.read-timeout": "5m",
"web.route-prefix": "/",
"web.user-assets": ""
}
}
```
Signed-off-by: Bartek Plotka <bwplotka@gmail.com>
2018-02-21 08:49:02 +00:00
Fabian Reinartz
7ccd4b39b8
*: implement query params
...
This adds a parameter to the storage selection interface which allows
query engine(s) to pass information about the operations surrounding a
data selection.
This can for example be used by remote storage backends to infer the
correct downsampling aggregates that need to be provided.
2018-02-13 12:17:22 +01:00
Conor Broderick
5169ccf258
Merge pull request #3724 from simonpasquier/fix-bad-data-error
...
Don't reset FiredAt for inactive alerts
2018-02-01 16:18:09 +00:00
Krasi Georgiev
b75428ec19
rename package retrieve to scrape
...
no fucnctinal changes just renaming retrieval to scrape
2018-02-01 09:55:07 +00:00
Krasi Georgiev
7858745c04
rename structs for consistency
2018-01-30 17:49:05 +00:00
Krasi Georgiev
acc4197098
remove dicovery race for the context field
2018-01-29 15:18:07 +00:00
Julien Pivotto
8b20cb1e8d
last config success time gauge: use SetToCurrentTime() ( #3750 )
...
Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>
2018-01-27 07:48:13 +00:00
Simon Pasquier
81c0ab69e0
Don't reset FiredAt for inactive alerts
...
Otherwise AlertManager receives resolved alerts where StartsAt is zero which
fails the validation.
2018-01-22 17:17:33 +01:00
Krasi Georgiev
719c579f7b
refactor main execution reloadReady handling, update some comments
2018-01-17 18:14:24 +00:00
Krasi Georgiev
0eafaf32d3
set the correct config reloading execution for scraper and notifier
2018-01-17 13:06:56 +00:00
Krasi Georgiev
97f0461e29
refactor the config reloading execution
2018-01-17 12:02:13 +00:00
Krasi Georgiev
5260c650ec
use the config hash for the map lookup
2018-01-16 11:10:54 +00:00
Krasi Georgiev
8369826808
comment to rethink the map reference for the notifier discovery
2018-01-16 09:47:53 +00:00
Krasi Georgiev
d12e6f29fc
discovery manager ApplyConfig now takes a direct ServiceDiscoveryConfig so that it can be used for the notify manager
...
reimplement the service discovery for the notify manager
Signed-off-by: Krasi Georgiev <krasi.root@gmail.com>
2018-01-15 13:39:44 +00:00
Goutham Veeramachaneni
35a6ffbaf3
Merge pull request #3587 from krasi-georgiev/web-test-error-check
...
handle web_test webhandler errors.
2018-01-10 22:03:25 +05:30
Brian Brazil
ecc24b554d
Hide block duration flags. ( #3618 )
...
Users are starting to use these mistakenly thinking they'll help
with issues, and thus causing some confusion.
Thus hide them and make it clear that they're only there for testing
reasons.
2017-12-24 12:13:48 +00:00
Krasi Georgiev
c94fa731aa
bypass the proxy for the tests
2017-12-20 18:21:10 +00:00
Krasi Georgiev
ad66476c4f
fix flaky main.go test and simplify a bit
2017-12-19 15:07:49 +00:00
Fabian Reinartz
2881d73ed8
Merge pull request #3362 from krasi-georgiev/discovery-refactoring
...
Decouple the discovery and refactor the retrieval package
2017-12-19 12:56:34 +01:00
Goutham Veeramachaneni
9c9f96b2c0
Merge pull request #3529 from krasi-georgiev/main-integration-test
...
main.go integration test for Startup interrupting.
2017-12-18 22:12:13 -06:00
Krasi Georgiev
587dec9eb9
rebased and resolved conflicts with the new Discovery GUI page
...
Signed-off-by: Krasi Georgiev <krasi.root@gmail.com>
2017-12-18 20:10:03 +00:00
Krasi Georgiev
1ec76d1950
rearange the contexts variables and logic
...
split the groupsMerge function to set and get
other small nits
2017-12-18 17:23:47 +00:00
Krasi Georgiev
6ff1d5c51e
add the scrape manager config reloader
...
handle errors with invalid scrape config
2017-12-18 17:23:47 +00:00
Krasi Georgiev
b0d4f6ee08
resolved merge confilc in main.go
2017-12-18 17:23:46 +00:00
Krasi Georgiev
c5cb0d2910
simplify naming and API.
2017-12-18 17:22:50 +00:00