prometheus

mirror of https://github.com/prometheus/prometheus.git synced 2024-12-26 14:09:41 -08:00

Author	SHA1	Message	Date
jessicagreben	61c9a89120	use milliseconds for blocksize Signed-off-by: jessicagreben <jessicagrebens@gmail.com>	2020-10-31 07:11:54 -07:00
jessicagreben	6980bcf671	unexport backfiller Signed-off-by: jessicagreben <jessicagrebens@gmail.com>	2020-10-31 06:40:56 -07:00
jessicagreben	3ed6457dd4	use blockwriter, rm multiwriter code Signed-off-by: jessicagreben <jessicagrebens@gmail.com>	2020-10-31 06:32:07 -07:00
Julien Pivotto	6c56a1faaa	Testify: move to require (#8122 ) * Testify: move to require Moving testify to require to fail tests early in case of errors. Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * More moves Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-10-29 09:43:23 +00:00
Bartlomiej Plotka	3d8826a3d4	MultiError: Refactored MultiError for more concise and safe usage. (#8066 ) * MultiError: Refactored MultiError for more concise and safe usage. * Less lines * Goland IDE was marking every usage of old MultiError "potential nil" error * It was easy to forgot using Err() when error was returned, now it's safely assured on compile time. NOTE: Potentially I would rename package to merrors. (: In different PR. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com> * Addressed review comments. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com> * Addressed comments. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com> * Fix after rebase. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com>	2020-10-28 15:24:58 +00:00
Julien Pivotto	1282d1b39c	Refactor test assertions (#8110 ) * Refactor test assertions This pull request gets rid of assert.True where possible to use fine-grained assertions. Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-10-27 11:06:53 +01:00
David Leadbeater	e7e60623ff	promtool: Calculate mint and maxt per test (#8096 ) * promtool: Calculate mint and maxt per test Previously a single test that used a later eval time would make all other tests in the file share the [mint, maxt] and potentially evaluate far more samples than needed. Fixes: #8019 Signed-off-by: David Leadbeater <dgl@dgl.cx>	2020-10-24 12:03:55 +01:00
Julien Pivotto	4e5b1722b3	Move away from testutil, refactor imports (#8087 ) Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-10-22 11:00:08 +02:00
jessicagreben	36ac0b68f1	merge master, fix conflicts	2020-10-17 08:20:21 -07:00
Björn Rabenstein	71577e45eb	Merge pull request #8044 from prometheus/beorn7/metrics Instrumentation: Report valid configs in the respective metrics from the beginning	2020-10-12 23:32:02 +02:00
Arthur Silva Sens	4f45e201cc	Promtool tsdb list now prints block sizes (#7993 ) * promtool tsdb list now prints blocks' size Signed-off-by: arthursens <arthursens2005@gmail.com>	2020-10-12 23:15:40 +02:00
beorn7	0f3c1bf6cf	Report valid configs in the respective metrics from the beginning In #7399, an early validity check of the config was introduced to prevent the scenario where an invalid config is only detected after a possibly very long startup procedure. However, the respective success metrics are not updated after the initial validation so that the success metrics suggest an invalid config. If the startup procedure, like replaying the WAL, really takes very long, alerts about invalid config will trigger. This commit sets the succes metrics after initial validation. They will be set again after the "real" config (re-)load, but that shouldn't be a problem. The metric now truthfully represents whenever the config was successfully loaded, no matter if the result was then thrown away (because it was just for validation) or actually used. Signed-off-by: beorn7 <beorn@grafana.com>	2020-10-12 21:30:59 +02:00
David Leadbeater	5393ec22cb	promtool: Don't end alert tests early, in some failure situations If an alert test had a failing test, then any other alert test interval specified after that point would result in the test exiting early. This made debugging some tests more difficult than needed. Now only exit early for evaluation failures. Signed-off-by: David Leadbeater <dgl@dgl.cx>	2020-10-09 12:59:59 +01:00
Frederic Branczyk	da3ea43242	Merge pull request #7976 from roidelapluie/tolerance Introduce timestamp tolerance in scrapes	2020-10-08 09:21:19 +02:00
Julien Pivotto	be5ba1a62d	Fix wordings Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-10-07 21:44:36 +02:00
Julien Pivotto	4617d16b4b	Specify the removal Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-10-07 18:32:04 +02:00
Julien Pivotto	e2a2bf3c06	Add context Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-10-07 18:30:32 +02:00
Julien Pivotto	627ff84599	Adjust flag Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-10-07 18:25:52 +02:00
Julien Pivotto	6b618ecf02	Better description Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-10-07 17:43:42 +02:00
Julien Pivotto	536dfb6234	Add an experimental, hidden flag Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-10-07 17:31:46 +02:00
Frederic Branczyk	6be3ebdfe7	Merge pull request #8015 from simonpasquier/bump-k8s-deps Bump k8s dependencies + support k8s.io/klog/v2	2020-10-07 09:54:58 +02:00
Julien Pivotto	946819e16e	cmd/prometheus: Issue a warning on 32 bit archs (#8012 ) * cmd/prometheus: Issue a warning on 32 bit archs Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-10-06 21:42:56 +02:00
Simon Pasquier	9bb3555fe4	cmd/prometheus: support k8s.io/klog/v2 Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2020-10-06 14:56:14 +02:00
David Leadbeater	77c784ac93	Ensure alert rules are marked as restored in unit tests (#7661 ) This makes sure the ALERTS timeseries is created when unit testing alerting rules. Signed-off-by: David Leadbeater <dgl@dgl.cx>	2020-09-21 18:15:34 +02:00
jessicagreben	2e526cf2a7	add output dir parameter Signed-off-by: jessicagreben <jessicagrebens@gmail.com>	2020-09-13 08:38:32 -07:00
jessicagreben	dfa510086b	add alignment, mv rule importer to promtool dir, add queryRange Signed-off-by: jessicagreben <jessicagrebens@gmail.com>	2020-09-13 08:07:59 -07:00
Julien Pivotto	442b3364d7	Promtool: add evaluation time to instant query (#7829 ) * Promtool: add evaluation time to instant query Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * Apply suggestion Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-08-25 11:32:25 +01:00
Andy Bursavich	4e6a94a27d	Invert service discovery dependencies (#7701 ) This also fixes a bug in query_log_file, which now is relative to the config file like all other paths. Signed-off-by: Andy Bursavich <abursavich@gmail.com>	2020-08-20 13:48:26 +01:00
Harold Dost	21a753c4e2	Make file permissions set to allow for wider umask options. (#7782 ) 0644 -> 0666 on all non vendored code. Fixes #7717 Signed-off-by: Harold Dost <harolddost@gmail.com>	2020-08-12 23:23:17 +02:00
Julien Pivotto	d661f84748	Log duration of reloads Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-08-06 21:49:26 +02:00
Annanay	9bba8a6eae	Merge branch 'master' into appender-context Signed-off-by: Annanay <annanayagarwal@gmail.com>	2020-07-30 16:43:18 +05:30
Julien Pivotto	01e3bfcd1a	Add warnings about NFS (#7691 ) * Add warnings about NFS Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-07-30 11:22:44 +02:00
Javier Palomo Almena	b58a613443	Replace sync/atomic with uber-go/atomic (#7683 ) * storage: Replace usage of sync/atomic with uber-go/atomic Signed-off-by: Javier Palomo <javier.palomo.almena@gmail.com> * tsdb: Replace usage of sync/atomic with uber-go/atomic Signed-off-by: Javier Palomo <javier.palomo.almena@gmail.com> * web: Replace usage of sync/atomic with uber-go/atomic Signed-off-by: Javier Palomo <javier.palomo.almena@gmail.com> * notifier: Replace usage of sync/atomic with uber-go/atomic Signed-off-by: Javier Palomo <javier.palomo.almena@gmail.com> * cmd: Replace usage of sync/atomic with uber-go/atomic Signed-off-by: Javier Palomo <javier.palomo.almena@gmail.com> * scripts: Verify that we are not using restricted packages It checks that we are not directly importing 'sync/atomic'. Signed-off-by: Javier Palomo <javier.palomo.almena@gmail.com> * Reorganise imports in blocks Signed-off-by: Javier Palomo <javier.palomo.almena@gmail.com> * notifier/test: Apply PR suggestions Signed-off-by: Javier Palomo <javier.palomo.almena@gmail.com> * storage/remote: avoid storing references on newEntry Signed-off-by: Javier Palomo <javier.palomo.almena@gmail.com> * Revert "scripts: Verify that we are not using restricted packages" This reverts commit `278d32748e`. Signed-off-by: Javier Palomo <javier.palomo.almena@gmail.com> * web: Group imports accordingly Signed-off-by: Javier Palomo <javier.palomo.almena@gmail.com>	2020-07-30 13:15:42 +05:30
jessicagreben	7504b5ce7c	add rule importer with tsdb block writer Signed-off-by: jessicagreben <Jessica.greben1+github@gmail.com>	2020-07-27 07:44:49 -07:00
Annanay	7f98a744e5	Add context to Appender interface Signed-off-by: Annanay <annanayagarwal@gmail.com>	2020-07-24 19:40:51 +05:30
chinhnc	e05c19da5d	Display block duration in promtool list blocks command (#7653 ) * Update tsdb.go Added DURATION column to `tsdb list` command Signed-off-by: soup <chicknsoupuds@gmail.com> * Use time.Duration instead of hardcoded hour Signed-off-by: soup <chicknsoupuds@gmail.com>	2020-07-24 19:01:20 +05:30
Ben Ye	50c261502e	add tsdb cmds into promtool (#6088 ) Signed-off-by: yeya24 <yb532204897@gmail.com> update tsdb cli in makefile and promu Signed-off-by: yeya24 <yb532204897@gmail.com> remove building tsdb bin Signed-off-by: yeya24 <yb532204897@gmail.com> remove useless func Signed-off-by: yeya24 <yb532204897@gmail.com> refactor analyzeBlock Signed-off-by: yeya24 <yb532204897@gmail.com> Fix Makefile Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2020-07-23 19:35:50 +01:00
Bartlomiej Plotka	a0df8a383a	promql: Removed global and add ability to have better interval for subqueries if not specified (#7628 ) * promql: Removed global and add ability to have better interval for subqueries if not specified ## Changes * Refactored tests for better hints testing * Added various TODO in places to enhance. * Moved DefaultEvalInterval global to opts with func(rangeMillis int64) int64 function instead Motivation: At Thanos we would love to have better control over the subqueries step/interval. This is important to choose proper resolution. I think having proper step also does not harm for Prometheus and remote read users. Especially on stateless querier we do not know evaluation interval and in fact putting global can be wrong to assume for Prometheus even. I think ideally we could try to have at least 3 samples within the range, the same way Prometheus UI and Grafana assumes. Anyway this interfaces allows to decide on promQL user basis. Open question: Is taking parent interval a smart move? Motivation for removing global: I spent 1h fighting with: === RUN TestEvaluations TestEvaluations: promql_test.go:31: unexpected error: error evaluating query "absent_over_time(rate(nonexistant[5m])[5m:])" (line 687): unexpected error: runtime error: integer divide by zero --- FAIL: TestEvaluations (0.32s) FAIL At the end I found that this fails on most of the versions including this master if you run this test alone. If run together with many other tests it passes. This is due to SetDefaultEvaluationInterval(1 * time.Minute) in test that is ran before TestEvaluations. Thanks to globals (: Let's fix it by dropping this global. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com> * Added issue links for TODOs. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com> * Removed irrelevant changes. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com>	2020-07-22 14:39:51 +01:00
Julien Pivotto	b83cbacbdd	Rule manager: remove blocking channel in mail (#7631 ) Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-07-22 00:13:24 +02:00
Ben Ye	e6ea798c32	promtool range query should exit when fail to parse time (#7505 ) Signed-off-by: yeya24 <yb532204897@gmail.com>	2020-07-16 23:53:04 +01:00
yeya24	797e48c1a3	support time range in promtool query labels Updated prometheus/client_golang and json-iterator/go Signed-off-by: yeya24 <yb532204897@gmail.com>	2020-07-03 11:29:39 -04:00
Frederic Branczyk	d17d88935c	rules: Use narrower interface for rule manager loading of for state (#7472 ) To load ALERT_FOR_STATE only `storage.Queryable` interface is required, so this patch uses this narrower interface for to perform this. Signed-off-by: Frederic Branczyk <fbranczyk@gmail.com>	2020-06-26 19:06:36 +01:00
Bartlomiej Plotka	b788986717	storage: Adjusted fully storage layer support for chunk iterators: Remote read client, readyStorage, fanout. (#7059 ) * Fixed nits introduced by https://github.com/prometheus/prometheus/pull/7334 * Added ChunkQueryable implementation to fanout and readyStorage. * Added more comments. * Changed NewVerticalChunkSeriesMerger to CompactingChunkSeriesMerger, removed tiny interface by reusing VerticalSeriesMergeFunc for overlapping algorithm for both chunks and series, for both querying and compacting (!) + made sure duplicates are merged. * Added ErrChunkSeriesSet * Added Samples interface for seamless []promb.Sample to []tsdbutil.Sample conversion. * Deprecating non chunks serieset based StreamChunkedReadResponses, added chunk one. * Improved tests. * Split remote client into Write (old storage) and read. * Queryable client is now SampleAndChunkQueryable. Since we cannot use nice QueryableFunc I moved all config based options to sampleAndChunkQueryableClient to aboid boilerplate. In next commit: Changes for TSDB. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com>	2020-06-24 14:41:52 +01:00
Harkishen Singh	70b0a34616	Exit early on invalid config file (#7399 ) * Reload config file at start Signed-off-by: Harkishen-Singh <harkishensingh@hotmail.com> * relocated config checking Signed-off-by: Harkishen-Singh <harkishensingh@hotmail.com> * change log lever Signed-off-by: Harkishen-Singh <harkishensingh@hotmail.com> * add helpful comment Signed-off-by: Harkishen-Singh <harkishensingh@hotmail.com>	2020-06-21 21:26:59 +05:30
Ben Kochie	8d3c2f6829	Enable WAL compression by default (#7410 ) Enable the `--storage.tsdb.wal-compression` flag by defualt. Signed-off-by: Ben Kochie <superq@gmail.com>	2020-06-18 17:59:40 +01:00
Jordan Neufeld	268b4c29e1	Support extended durations in promtool unit tests (Fixes #6285 ) (#6297 ) * Fixed evaluation_time duration parsing in promtool unit tests (Fixes #6285) Signed-off-by: Jordan Neufeld <jordan@neufeldtech.com>	2020-06-15 16:03:07 +01:00
Arthur Silva Sens	7727b9012e	Correction of misleading help text(#5142 ) (#7231 ) * Correction of misleading help text(#5142) Signed-off-by: arthursens <arthursens2005@gmail.com>	2020-05-11 12:15:01 +01:00
Julien Pivotto	9e265aba10	Merge pull request #7225 from prometheus/release-2.18 [Merge without Squash] Merge release-2.18 back to master for 2.18.1 fixes.	2020-05-07 21:23:59 +02:00
Hongcai Ren	c7e82274c6	replace github.com/prometheus/prometheus/testutil/promlint by github.com/prometheus/client_golang/prometheus/testutil/promlint from our codebase (#7209 ) Signed-off-by: RainbowMango <renhongcai@huawei.com>	2020-05-07 11:34:39 +01:00
Julien Pivotto	645b71e9ef	Fix snapshots (#7217 ) Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-05-07 10:03:48 +01:00
Ganesh Vernekar	d4b9fe801f	M-map full chunks of Head from disk (#6679 ) When appending to the head and a chunk is full it is flushed to the disk and m-mapped (memory mapped) to free up memory Prom startup now happens in these stages - Iterate the m-maped chunks from disk and keep a map of series reference to its slice of mmapped chunks. - Iterate the WAL as usual. Whenever we create a new series, look for it's mmapped chunks in the map created before and add it to that series. If a head chunk is corrupted the currpted one and all chunks after that are deleted and the data after the corruption is recovered from the existing WAL which means that a corruption in m-mapped files results in NO data loss. [Mmaped chunks format](https://github.com/prometheus/prometheus/blob/master/tsdb/docs/format/head_chunks.md) - main difference is that the chunk for mmaping now also includes series reference because there is no index for mapping series to chunks. [The block chunks](https://github.com/prometheus/prometheus/blob/master/tsdb/docs/format/chunks.md) are accessed from the index which includes the offsets for the chunks in the chunks file - example - chunks of series ID have offsets 200, 500 etc in the chunk files. In case of mmaped chunks, the offsets are stored in memory and accessed from that. During WAL replay, these offsets are restored by iterating all m-mapped chunks as stated above by matching the series id present in the chunk header and offset of that chunk in that file. Prombench results _WAL Replay_ 1h Wal reply time 30% less wal reply time - 4m31 vs 3m36 2h Wal reply time 20% less wal reply time - 8m16 vs 7m _Memory During WAL Replay_ High Churn: 10-15% less RAM - 32gb vs 28gb 20% less RAM after compaction 34gb vs 27gb No Churn: 20-30% less RAM - 23gb vs 18gb 40% less RAM after compaction 32.5gb vs 20gb Screenshots are in [this comment](https://github.com/prometheus/prometheus/pull/6679#issuecomment-621678932) Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in>	2020-05-06 21:00:00 +05:30
Ben Ye	1e4e37144d	Fixed wrongly handled not ready TSDB on web and API. (#7182 ) * fix federate endpoint panic Signed-off-by: yeya24 <yb532204897@gmail.com> * Fixed all cases of not ready TSDB being wrongly handled. * Fixed issue for federation. * Ensured this will never happen again thanks to interfaces * Fixes same issue for stats. * Added tests for readiness. * Fixed bug in stats. It was: status.MaxTime = db.Head().MaxTime() status.MinTime = db.Head().MaxTime() Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com> * Addressed Brian's comments. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com> * Addressed Brian's comments. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com> Co-authored-by: Bartlomiej Plotka <bwplotka@gmail.com>	2020-04-29 17:16:14 +01:00
Vasily Sliouniaev	0393b188c9	Add Jaeger (#7148 ) * Trace remote read Signed-off-by: vas <vasily.sliouniaev@jet.com> * Use jaeger Signed-off-by: vas <vasily.sliouniaev@jet.com>	2020-04-23 02:05:55 +02:00
Marek Slabicki	8224ddec23	Capitalizing first letter of all log lines (#7043 ) Signed-off-by: Marek Slabicki <thaniri@gmail.com>	2020-04-11 09:22:18 +01:00
Brian Brazil	7646cbca32	Use .UTC everywhere we use time.Unix (#7066 ) time.Unix attaches the local timezone, which can then leak out (e.g. in the alert json). While this is harmless, we should be consistent. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>	2020-03-29 17:35:39 +01:00
Ben Kochie	269e7c8091	Fix golint issues. Signed-off-by: Ben Kochie <superq@gmail.com>	2020-03-23 20:38:43 +01:00
johncming	bbacd2dd09	remove needless break. (#7008 ) Signed-off-by: johncming <johncming@yahoo.com>	2020-03-19 11:21:00 +00:00
李国忠	52025bd7a9	[comments] change word ‘wheter’ to ‘whether’ (#6912 ) * [comments] change word ‘wheter’ to ‘whether’ Signed-off-by: fuling <fuling.lgz@alibaba-inc.com> * [comments] change word ‘wheter’ to ‘whether’ Signed-off-by: fuling <fuling.lgz@alibaba-inc.com>	2020-03-02 13:51:24 +05:30
Tobias Guggenmos	4835bbf376	Merge branch 'master' into split_parser	2020-02-19 15:18:13 +01:00
Bartlomiej Plotka	48ead578a0	Moved tsdbconfig to main. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com>	2020-02-18 11:25:36 +00:00
Bartlomiej Plotka	a20bebf7eb	Moved readyStorage to main. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com>	2020-02-17 18:03:57 +00:00
Bartlomiej Plotka	8a775bc468	Moved unit agnostic options to separate pkg. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com>	2020-02-17 18:03:57 +00:00
Bartlomiej Plotka	59c9d6ef45	Addressed Brian's comments, moved metrics to main.go Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com>	2020-02-17 18:03:57 +00:00
Bartlomiej Plotka	cfba92a133	Addressed comments. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com>	2020-02-17 18:03:57 +00:00
Bartlomiej Plotka	34426766d8	Unify Iterator interfaces. All point to storage now. This is part of https://github.com/prometheus/prometheus/pull/5882 that can be done to simplify things. All todos I added will be fixed in follow up PRs. * querier.Querier, querier.Appender, querier.SeriesSet, and querier.Series interfaces merged with storage interface.go. All imports that. * querier.SeriesIterator replaced by chunkenc.Iterator * Added chunkenc.Iterator.Seek method and tests for xor implementation (?) * Since we properly handle SelectParams for Select methods I adjusted min max based on that. This should help in terms of performance for queries with functions like offset. * added Seek to deletedIterator and test. * storage/tsdb was removed as it was only a unnecessary glue with incompatible structs. No logic was changed, only different source of abstractions, so no need for benchmarks. Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com>	2020-02-17 18:03:54 +00:00
Tobias Guggenmos	454ba12676	Fix build errors in promtool Signed-off-by: Tobias Guggenmos <tguggenm@redhat.com>	2020-02-17 16:09:23 +01:00
Björn Rabenstein	af04cb22c8	Merge pull request #6821 from prometheus/release-2.16 Release 2.16	2020-02-14 13:10:14 +01:00
Julien Pivotto	ff0003e072	Make lookbackDelta a option of QueryEngine (#6746 ) * Make lookbackDelta a option of QueryEngine Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * julius' suggestion Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * remove trivial getter Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * Assume lookback delta is always > 0 Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * add debug log Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * don't expose loopback delta Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * Specify that lookack delta is also used in federation Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * Fix federation test While we have added some logic to the promql engine to keep it backwards compatible and have a 5 minute loopback by default, the web/ package is likely to really be internal to Prometheus and we should not add the same kind of heuritstics here. Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu> * loopback delta: Fix debug log Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-02-10 00:58:23 +01:00
Julien Pivotto	d799078c88	also test start and end Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-02-08 16:42:50 +01:00
Julien Pivotto	881dde505a	promql: fix promql query log step unit Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-02-08 16:26:56 +01:00
Julien Pivotto	3c4c01eae2	Fix race in Query Log Test (#6727 ) A data race can happen if we run t.Log after the test t is done -- which in this case is highly possible because of the use of subtests and the fact that we call t.Log in a goroutine. Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-01-30 13:51:18 -08:00
Julien Pivotto	9adad8ad30	Remove MaxConcurrent from the PromQL engine opts (#6712 ) Since we use ActiveQueryTracker to check for concurrency in `d992c36b3a` it does not make sense to keep the MaxConcurrent value as an option of the PromQL engine. This pull request removes it from the PromQL engine options, sets the max concurrent metric to -1 if there is no active query tracker, and use the value of the active query tracker otherwise. It removes dead code and also will inform people who import the promql package that we made that change, as it breaks the EngineOpts struct. Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-01-28 20:38:49 +00:00
Julien Pivotto	5f27ac3583	Refactor query log fields (#6694 ) * Refactor query log fields Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-01-27 09:53:10 +00:00
Julien Pivotto	2b2eb79e8b	Add windows tests for query logger (#6653 ) * Add windows tests * Do not rely on time.Time in timer Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-01-20 13:17:11 +00:00
Julien Pivotto	0eb34299da	End-to-end Query Log test (#6600 ) * End-to-end Query Log test Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-01-19 21:56:13 +00:00
Julien Pivotto	1a58d2657d	Removed compilation step inside main_test (#6658 ) Inspired by https://github.com/prometheus/prometheus/pull/6347 and https://github.com/prometheus/prometheus/pull/6347#issuecomment-570151979 Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-01-19 07:14:25 +00:00
Harkishen Singh	84e6459c4d	Adds support for line-column numbers for invalid rules, promtool (#6533 ) Signed-off-by: Harkishen Singh <harkishensingh@hotmail.com>	2020-01-15 18:07:54 +00:00
Julien Pivotto	3885562587	Query Logging styling (#6594 ) - Fix Json vs JSON in activequerylogger - Fix SetQueryLogger always returns nil Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-01-09 21:11:39 +00:00
Julien Pivotto	9d9bc524e5	Add query log (#6520 ) * Add query log, make stats logged in JSON like in the API Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-01-08 13:28:43 +00:00
Simon Pasquier	cccd542891	*: avoid missed Alertmanager targets (#6455 ) This change makes sure that nearly-identical Alertmanager configurations aren't merged together. The config's identifier was the MD5 hash of the configuration serialized to JSON but because `relabel.Regexp` has no public field and doesn't implement the JSON.Marshaler interface, it was always serialized to "{}". In practice, the identifier can be based on the index of the configuration in the list. Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-12-12 17:00:19 +01:00
Brooks Swinnerton	0ea3a2218d	Add time units to storage.tsdb.retention.size flag (#6365 ) * Add time units to storage.tsdb.retention.size flag In an effort to reduce confusion with the `m` option of the `ParseDuration()` function, this commit adds the available time units to the `storage.tsdb.retention.time` flag to help showcase that there is no option for months (which could be assumed to be `m`). If someone were looking to set the retention to six months, they may mistakenly do so with `6m`, which would reduce their retention to six minutes. Signed-off-by: Brooks Swinnerton <bswinnerton@gmail.com>	2019-11-30 08:00:51 +00:00
johncming	ad4bc5701e	remove unwanted break (#6338 ) Signed-off-by: johncming <johncming@yahoo.com>	2019-11-18 23:01:03 -08:00
akerele abraham	9d39fdad0c	unittest: check for rule files existence (#6075 ) Signed-off-by: akerele abraham <abrahamakerele38@gmail.com>	2019-11-18 13:54:52 -08:00
Chris Marchbanks	1d1f64b4bc	Fix Promtool showing false duplicate rule warnings (#6270 ) Alert rules do not use the Record field, so any alerts with the same labels and different names would be counted as being duplicates. Promtool will now consider either field when finding duplicates. Signed-off-by: Chris Marchbanks <csmarchbanks@gmail.com>	2019-11-05 11:22:31 -07:00
Simon Pasquier	ddff1480a7	cmd/promtool: improve output for PromQL tests (#6052 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-09-25 09:26:29 +02:00
Harkishen Singh	e097c70e6d	add checks for metrics and display duplicate fields (#6026 ) Signed-off-by: Harkishen-Singh <harkishensingh@hotmail.com>	2019-09-20 11:29:47 +01:00
Simon Pasquier	06066a3619	*: improve error messages when parsing bad rules (#5965 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-08-28 17:36:48 +02:00
Sayan Chowdhury	cb66e325d8	Show the warnings during label query (#5924 ) This patch loops through the warnings while querying the label and spits the output to stderr Fixes #5885 Signed-off-by: Sayan Chowdhury <sayan.chowdhury2012@gmail.com>	2019-08-24 19:42:21 +02:00
Bartek Płotka	48b2c9c8ea	remote-read: streamed chunked server side; Extended protobuf; Added chunked, checksumed reader (#5703 ) Part of: https://github.com/prometheus/prometheus/issues/4517 and https://github.com/improbable-eng/thanos/issues/488 Changes: * Extended protobuf for chunked remote read and negotation. * Added checksumed, chunked Writer/Reader. * Added Server side implementation for chunked streamed remote-read. Signed-off-by: Bartek Plotka <bwplotka@gmail.com>	2019-08-19 21:16:10 +01:00
Bartek Płotka	5cb32d67f9	Merge pull request #5893 from prometheus/unify-tsdbutil Removed extra tsdb/testutil after merge.	2019-08-15 12:07:59 +01:00
Bartek Plotka	f0863a604e	Removed extra tsdb/testutil after merge. Signed-off-by: Bartek Plotka <bwplotka@gmail.com>	2019-08-14 10:12:32 +01:00
Julius Volz	b5c833ca21	Update go.mod dependencies before release (#5883 ) * Update go.mod dependencies before release Signed-off-by: Julius Volz <julius.volz@gmail.com> * Add issue for showing query warnings in promtool Signed-off-by: Julius Volz <julius.volz@gmail.com> * Revert json-iterator back to 1.1.6 It produced errors when marshaling Point values with special float values. Signed-off-by: Julius Volz <julius.volz@gmail.com> * Fix expected step values in promtool tests after client_golang update Signed-off-by: Julius Volz <julius.volz@gmail.com> * Update generated protobuf code after proto dep updates Signed-off-by: Julius Volz <julius.volz@gmail.com>	2019-08-14 11:00:39 +02:00
Advait Bhatwadekar	5d401f1e1b	Added query logging for prometheus. Issue #1315 (#5794 ) * Added query logging for prometheus. Options added: 1) active.queries.filepath: Filename where queries will be recorded 2) active.queries.filesize: Size of the file where queries will be recorded. Functionality added: All active queries are now logged in a file. If prometheus crashes unexpectedly, these queries are also printed out on stdout in the rerun. Queries are written concurrently to an mmaped file, and removed once they are done. Their positions in the file are reused. They are written in json format. However, due to dynamic nature of application, the json has an extra comma after the last query, and is missing an ending ']'. There may also null bytes in the tail of file. Signed-off-by: Advait Bhatwadekar <advait123@ymail.com>	2019-07-31 16:12:43 +01:00
Simon Pasquier	75886e0464	cmd/promtool: fix panic with empty exp_labels Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-07-17 17:02:31 +02:00
Chris Marchbanks	06f1ba73eb	Provide flag to compress the tsdb WAL Signed-off-by: Chris Marchbanks <csmarchbanks@gmail.com>	2019-07-03 08:03:29 -06:00
Tom Wilkie	851131b074	Allow injection of arbitrary headers in promtool, for auth etc. (#4389 ) Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>	2019-06-30 11:50:23 +01:00
Simon Pasquier	be67b8d460	web: fix flaky TestHTTPMetrics() (#5695 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-06-24 15:48:15 +02:00
Björn Rabenstein	dc22f74153	Merge pull request #5608 from simonpasquier/external-labels-for-alert-tests cmd/promtool: add $externalLabels for alert unit tests	2019-06-20 16:48:12 +02:00
Björn Rabenstein	372b3438e5	Update prometheus/client_golang to v1.0.0 (#5682 ) Signed-off-by: beorn7 <beorn@grafana.com>	2019-06-17 19:14:36 +01:00
Keenan Romain	55f3a9fe4a	Allows globs for rules when unit testing (#5595 ) * Includes glob support when unit testing rule_files. Signed-off-by: Keenan Romain <Keenan.Romain@mailchimp.com>	2019-06-12 11:31:07 +01:00
Simon Pasquier	74ff35ccdd	cmd/promtool: add $externalLabels for alert unit tests Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-05-29 16:40:01 +02:00
beorn7	aff4738f33	Adjust TestQueryRange to new Prometheus API client Signed-off-by: beorn7 <bjoern@rabenste.in>	2019-05-17 18:09:47 +02:00
Lee Gaines	f4486815c1	logs filesystem type on startup (#5558 ) Signed-off-by: Lee Gaines <leetgaines@gmail.com>	2019-05-17 10:16:16 +01:00
Björn Rabenstein	0a34399611	Fix minor punctuation and language issues in flag doc strings (#5568 ) This is mostly to create consistency, not because the one or the other way would be wrong. A few actual corrections are also included. Signed-off-by: beorn7 <bjoern@rabenste.in>	2019-05-15 16:59:06 +02:00
Simon Pasquier	45506841e6	*: enable all default linters (#5504 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-05-03 15:11:28 +02:00
Simon Pasquier	9c69eec82a	cmd/promtool: use log.NewNopLogger() (#5531 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-05-03 10:00:07 +01:00
Frederic Branczyk	c790d7658c	Merge pull request #5491 from metalmatze/rungroup Use github.com/oklog/run not archived oklog/oklog	2019-04-29 16:22:16 +02:00
Björn Rabenstein	0be9388f8d	Merge pull request #5463 from prometheus/beorn7/templating Follow-up on #5009	2019-04-24 16:42:23 +02:00
Simon Pasquier	abc1994bec	cmd/promtool: return errors from rule evaluations (#5483 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-04-23 09:59:03 +02:00
Matthias Loibl	388caa06ac	Use github.com/oklog/run not archived oklog/oklog Signed-off-by: Matthias Loibl <mail@matthiasloibl.com>	2019-04-19 14:55:28 +02:00
Bjoern Rabenstein	38d518c0fe	Rework #5009 after comments Signed-off-by: Bjoern Rabenstein <bjoern@rabenste.in>	2019-04-17 01:40:10 +02:00
Bjoern Rabenstein	a92ef68dd8	Fix staticcheck errors Not sure why they only show up now. Signed-off-by: Bjoern Rabenstein <bjoern@rabenste.in>	2019-04-17 01:40:10 +02:00
Sylvain Rabot	335a34486e	Add external labels to template expansion This affects the expansion of templates in alert labels and annotations and console templates. Signed-off-by: Sylvain Rabot <sylvain@abstraction.fr>	2019-04-17 01:40:10 +02:00
Simon Pasquier	e5dbac7972	cmd/prometheus: group flags properly (#5419 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-04-10 13:22:05 +01:00
David Symonds	7a60e22c2d	cmd/promtool: resolve relative paths in alert test files (#5336 ) Like `promtool check config <path/to/foo.yaml>`, which resolves relative paths inside foo.yaml to be relative to `path/to`, this now makes `promtool test rules <path/to/test.yaml>` do the same thing. Signed-off-by: David Symonds <dsymonds@gmail.com>	2019-03-27 10:27:26 +01:00
Tariq Ibrahim	8fdfa8abea	refine error handling in prometheus (#5388 ) i) Uses the more idiomatic Wrap and Wrapf methods for creating nested errors. ii) Fixes some incorrect usages of fmt.Errorf where the error messages don't have any formatting directives. iii) Does away with the use of fmt package for errors in favour of pkg/errors Signed-off-by: tariqibrahim <tariq181290@gmail.com>	2019-03-26 00:01:12 +01:00
Brian Brazil	0a87dcd416	cmd: Warn rather than Info when retention time wraps (#5403 ) Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>	2019-03-25 18:06:38 +00:00
Krasi Georgiev	9d96ada510	Display correct values for the retention in the flags web gui. (#5322 ) * Display correct values for the retention in the flags web gui. Signed-off-by: Krasi Georgiev <kgeorgie@redhat.com> * adding a log entry Signed-off-by: Krasi Georgiev <kgeorgie@redhat.com> * added the retention info to the runtime status page Signed-off-by: Krasi Georgiev <kgeorgie@redhat.com> * simplify the retention display Signed-off-by: Krasi Georgiev <kgeorgie@redhat.com>	2019-03-11 22:48:57 +05:30
Krasi Georgiev	1684dc750a	updated tsdb to 0.6.0 (#5292 ) * updated tsdb to 0.6.0 as part of the update also added the new storage.tsdb.allow-overlapping-blocks flag and mark it as experimental.	2019-03-04 21:42:45 +02:00
Simon Pasquier	c8a1a5a93c	discovery/kubernetes: fix support for password_file and bearer_token_file (#5211 ) * discovery/kubernetes: fix support for password_file Signed-off-by: Simon Pasquier <spasquie@redhat.com> * Create and pass custom RoundTripper to Kubernetes client Signed-off-by: Simon Pasquier <spasquie@redhat.com> * Use inline HTTPClientConfig Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-02-20 11:22:34 +01:00
Krasi Georgiev	a3c41f4256	use the default time retention value only when no size retention is set (#5216 ) fixes https://github.com/prometheus/prometheus/issues/5213 Now that we have time and size base retention time bases should not have a default value. A default is set only when both - time and size flags are not set. This change will not affect current installations that rely on the default time based value, and will avoid confusions when only the size retention is set and it is expected that the default time based setting would be no longer in place. Signed-off-by: Krasi Georgiev <kgeorgie@redhat.com>	2019-02-19 13:53:43 +02:00
Callum Styan	6f69e31398	Tail the TSDB WAL for remote_write This change switches the remote_write API to use the TSDB WAL. This should reduce memory usage and prevent sample loss when the remote end point is down. We use the new LiveReader from TSDB to tail WAL segments. Logic for finding the tracking segment is included in this PR. The WAL is tailed once for each remote_write endpoint specified. Reading from the segment is based on a ticker rather than relying on fsnotify write events, which were found to be complicated and unreliable in early prototypes. Enqueuing a sample for sending via remote_write can now block, to provide back pressure. Queues are still required to acheive parallelism and batching. We have updated the queue config based on new defaults for queue capacity and pending samples values - much smaller values are now possible. The remote_write resharding code has been updated to prevent deadlocks, and extra tests have been added for these cases. As part of this change, we attempt to guarantee that samples are not lost; however this initial version doesn't guarantee this across Prometheus restarts or non-retryable errors from the remote end (eg 400s). This changes also includes the following optimisations: - only marshal the proto request once, not once per retry - maintain a single copy of the labels for given series to reduce GC pressure Other minor tweaks: - only reshard if we've also successfully sent recently - add pending samples, latest sent timestamp, WAL events processed metrics Co-authored-by: Chris Marchbanks <csmarchbanks.com> (initial prototype) Co-authored-by: Tom Wilkie <tom.wilkie@gmail.com> (sharding changes) Signed-off-by: Callum Styan <callumstyan@gmail.com>	2019-02-12 11:39:13 +00:00
Brian Brazil	1dd57765b4	Reduce time that alertmanagers are in flux when reloaded. (#5126 ) This no longer waits for all of the scrape reload to complete before getting a list of AMs again. Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>	2019-01-28 18:34:12 +00:00
Goutham Veeramachaneni	4068968e12	Protect retention from overflowing (#5112 ) Also sanitise the max block duration to max a month. Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com>	2019-01-18 20:18:06 +05:30
Goutham Veeramachaneni	384cba1211	Add flag for size based retention (#5109 ) * Add flag for size based retention Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com> * Deprecate the old retention flag for a new one. Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com> * Add ability to take a suffix for size flag Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com> * Address feedback Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com>	2019-01-18 19:18:36 +05:30
Hrishikesh Barman	a1f34bec2e	Added CORS Origin flag (#5011 ) Signed-off-by: Hrishikesh Barman <hrishikeshbman@gmail.com>	2019-01-17 15:01:06 +00:00
Matt Layher	302148fd69	*: apply gofmt -s Signed-off-by: Matt Layher <mdlayher@gmail.com>	2019-01-16 17:28:14 -05:00
Ryan Leung	45c8b084c6	fix TestFailedStartupExitCode (#5076 ) Signed-off-by: rleungx <rleungx@gmail.com>	2019-01-16 10:13:36 +01:00
Lv Jiawei	b8ede99767	Fix comment typo (#5087 ) According to code, I think it is a typo. Signed-off-by: MIBc <lvjiawei@cmss.chinamobile.com>	2019-01-09 10:56:47 +00:00
Frederic Branczyk	e9ae0b5a1b	Merge pull request #4927 from tariq1890/update_k8s update client-go to v10.0.0 and other k8s deps to v1.13.1	2019-01-07 10:54:34 +01:00
Simon Pasquier	f678e27eb6	: use latest release of staticcheck (#5057 ) : use latest release of staticcheck It also fixes a couple of things in the code flagged by the additional checks. Signed-off-by: Simon Pasquier <spasquie@redhat.com> Use official release of staticcheck Also run 'go list' before staticcheck to avoid failures when downloading packages. Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-01-04 14:47:38 +01:00
tariqibrahim	9b4a25e7b0	use klog dependency Signed-off-by: tariqibrahim <tariq181290@gmail.com>	2019-01-03 13:57:20 -08:00
glutamatt	5ddde1965b	tune the "Wal segment size" with a flag (#5029 ) Add WALSegmentSize as an option, and the corresponding flag "storage.tsdb.wal-segment-size" to tune the max size of wal segment files. The addressed base problem is to reduce the disk space used by wal segment files : on a raspberry pi, for instance, we often want to reduce write load of the sd card, then, the wal directory is mounted on a memory (space limited) partition. the default value of the segment max file size, pushed the size of directory to 128 MB for each segment , which is too much ram consumption on a rasp. the initial discussion is at https://github.com/prometheus/tsdb/pull/450	2019-01-03 17:13:21 +03:00
Ganesh Vernekar	7d30ccd0eb	Sort samples before comparing - PromQL unit test (#5052 ) Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in>	2018-12-31 10:55:49 +00:00
Ganesh Vernekar	dbe55c1352	Subquery (#4831 ) Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in>	2018-12-22 13:47:13 +00:00
Simon Pasquier	a2766a94a3	cmd/prometheus: add tests for sendAlerts() (#4910 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2018-12-18 11:15:46 +00:00
AixesHunter	1b166d7174	Fix variable 'notifier' collides with imported package name 'github.com/prometheus/prometheus/notifier', changed to 'notifierManager'. (#4947 ) Signed-off-by: aixeshunter <aixeshunter@gmail.com>	2018-12-18 11:13:18 +00:00
Ganesh Vernekar	fbadd88ba5	Get unique eval times for alert unit tests (#4964 ) Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in>	2018-12-18 08:40:03 +00:00
Simon Pasquier	ac9d5f3d53	cmd/prometheus: replace glog by glog-gokit (#4931 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2018-12-04 15:01:12 +01:00
Krasi Georgiev	080e6ed31a	collect cpu and trace profiles with the promtool debug command (#4897 ) Signed-off-by: Krasi Georgiev <kgeorgie@redhat.com>	2018-11-23 17:57:31 +02:00
Alex Yu	5dcce32ef8	update promlog to latest version (#4876 ) * update promlog to latest version Signed-off-by: Alex Yu <yu.alex96@gmail.com> * Update api tests, fix main setup Signed-off-by: Alex Yu <yu.alex96@gmail.com> * tidy go.sum Signed-off-by: Alex Yu <yu.alex96@gmail.com> * revendor prometheus/common Signed-off-by: Alex Yu <yu.alex96@gmail.com> * only initialize config; use kingpin for remote_storage_adapter Signed-off-by: Alex Yu <yu.alex96@gmail.com> * actually parse the flags Signed-off-by: Alex Yu <yu.alex96@gmail.com> * clean up imports Signed-off-by: Alex Yu <yu.alex96@gmail.com>	2018-11-23 14:22:40 +01:00
Ganesh Vernekar	cfb3769274	Lazily load samples for unit testing (#4851 ) * Lazily load samples for unit testing Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in> * cleanup Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in>	2018-11-22 14:21:38 +05:30
achiuBAE	a9050c45f6	Allow setting the Prometheus instance document title through a flag. (#4841 ) * web: added ability to set page title through flag. Signed-off-by: Andrew Chiu <andrew.chiu2@baesystems.com> * Reformatted variable names and Flag description for readability. Signed-off-by: Andrew Chiu <andrew.chiu2@baesystems.com> * assets_vfsdata.go Signed-off-by: Andrew Chiu <andrew.chiu2@baesystems.com> * Flag name changed from web.ui-title to web.page-title Signed-off-by: Andrew Chiu <andrew.chiu2@baesystems.com> * make assets Signed-off-by: Andrew Chiu <andrew.chiu2@baesystems.com>	2018-11-21 12:45:06 +08:00
stuart nelson	6a69471bc2	[promtool] Support writing output as json (#4848 ) * Support writing output as json Oftentimes I'll want to execute something based on the output from promtool, and supporting json makes it easy to pull out values with a supporting tool such as jq. Signed-off-by: stuart nelson <stuartnelson3@gmail.com>	2018-11-14 18:40:07 +01:00
Lucas Serven	70c8b2c63c	cmd/prometheus: buffer signal chans According to the GoDoc for os.Signal [0]: > Package signal will not block sending to c: the caller must ensure that > c has sufficient buffer space to keep up with the expected signal rate. > For a channel used for notification of just one signal value, a buffer > of size 1 is sufficient. [0] https://golang.org/pkg/os/signal/#Notify Signed-off-by: Lucas Serven <lserven@gmail.com>	2018-11-14 10:33:28 +01:00
Frederic Branczyk	bda9781ccd	Merge pull request #3839 from brancz/remove-old-alert-record promql: Remove old and unused alerting/reconding syntax	2018-11-06 15:53:27 +01:00
Simon Pasquier	a30348f1a4	discovery: add config label to discovered targets metric (#4753 ) * discovery: add labels to discovered targets metric Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2018-10-18 16:46:59 +01:00
Callum Styan	9bca041285	WIP: keep track of samples per query, set a max # of samples (#4513 ) * keep track of samples per query, set a max # of samples that can be in memory at once Signed-off-by: Callum Styan <callumstyan@gmail.com>	2018-10-02 12:59:19 +01:00
Tom Wilkie	4c52400708	Limit concurrent remote reads. (#4656 ) Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>	2018-09-25 20:07:34 +01:00
Ganesh Vernekar	5790d23fd8	Unit testing for rules (#4350 ) * Unit testing for rules * Specifying order of group evaluation in unit tests Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in>	2018-09-25 17:06:26 +01:00
Tom Wilkie	457e4bb58e	Limit the number of samples remote read can return. (#4532 ) * Limit the number of samples remote read can return. - Return 413 entity too large. - Limit can be set be a flag. Allow 0 to mean no limit. - Include limit in error message. - Set default limit to 50M (* 16 bytes = 800MB). Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>	2018-09-05 15:50:50 +02:00
Chris Marchbanks	63ed9d1b70	Send EndsAt along with alerts (#4550 ) Signed-off-by: Chris Marchbanks <csmarchbanks@gmail.com>	2018-08-28 16:05:00 +01:00
Chris Marchbanks	87f1dad16d	throttle resends of alerts to 1 minute by default (#4538 ) Signed-off-by: Chris Marchbanks <csmarchbanks@gmail.com>	2018-08-27 17:41:42 +01:00
Krasi Georgiev	12fe204ea6	move runtime debug funcs in own package (#4494 ) To make local debuging with `go run` easyer moved all files into a dedicate package `runtime`. This allows running prometheus just by using `go run main.go` instead of passing mani files like `go run main.go limits_default.go ...` Signed-off-by: Krasi Georgiev <kgeorgie@redhat.com>	2018-08-22 13:41:11 +03:00
Simon Pasquier	08c2f50382	Merge pull request #4418 from simonpasquier/log-vm-limits prometheus: log virtual memory limits	2018-08-07 16:27:46 +02:00
Frederic Branczyk	b0b3e3dd74	promql: Remove old and unused alerting/reconding syntax Signed-off-by: Frederic Branczyk <fbranczyk@gmail.com>	2018-08-07 15:14:06 +02:00
Dave Henderson	73a08f0045	promtool - Adding --step flag to 'query range' subcommand (#4454 ) Signed-off-by: Dave Henderson <dhenderson@gmail.com>	2018-08-05 11:03:18 +02:00
Julius Volz	90521a65f8	Remove error return value from NotifyFunc() (#4459 ) It's always nil and we also forgot to check it. Signed-off-by: Julius Volz <julius.volz@gmail.com>	2018-08-04 21:31:12 +02:00
Ganesh Vernekar	f1db699dff	Persist alert 'for' state across restarts (#4061 ) Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in>	2018-08-02 11:18:24 +01:00
Simon Pasquier	a94450c288	Fix build for openbsd Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2018-07-31 14:41:30 +02:00
Simon Pasquier	141c188ae6	Enforce conversion for freebsd Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2018-07-26 14:58:56 +02:00
Simon Pasquier	208d21a393	Add comment and print units Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2018-07-26 10:26:58 +02:00
Simon Pasquier	ba22b10113	prometheus: log virtual memory limits Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2018-07-25 15:51:27 +02:00
Daisy T	a3376e8f36	add query labels command to promtool (#4346 ) Signed-off-by: Daisy T <daisyts@gmx.com>	2018-07-18 16:27:28 +02:00
Julius Volz	95dfb1b1dd	Add missing import to promtool, fix build (#4395 ) Sorry, I used GitHub's web-based merge-conflict-resolution editor on https://github.com/prometheus/prometheus/pull/4308 and it didn't show me test errors afterwards, but maybe they didn't run again or I should have waited or something. Signed-off-by: Julius Volz <julius.volz@gmail.com>	2018-07-18 10:26:45 +02:00
Shubheksha	125da3b812	promtool: add command for querying series (#4308 ) Signed-off-by: Shubheksha Jalan <jshubheksha@gmail.com>	2018-07-18 10:15:58 +02:00
Julius Volz	03aa3a3de8	main: Improve / clean up error messages (#4286 ) Signed-off-by: Julius Volz <julius.volz@gmail.com>	2018-07-18 09:58:40 +02:00
Chih-Hung Yeh	912d19fb85	Add 3 commands in `promtool` for getting debug information from prometheus server (#4247 ) `debug all` - all information `debug metrics` - metrics information `debug pprof` - profiling information the final result is compressed in a `tar.gz` file Signed-off-by: chyeh <chyeh.taiwan@gmail.com>	2018-07-18 10:52:01 +03:00
Brian Brazil	68e8b80ffe	Reorder startup and shutdown to prevent panics. (#4321 ) Start rule manager only after tsdb and config is loaded. Stop rule manager before tsdb to avoid writing to closed storage. Wait for any in-progress reloads to complete before shutting down rule manager, so that rule manager doesn't get updated after being shut down. Remove incorrect comment around shutting down query enginge. Log when config reload is completed. Fixes #4133 Fixes #4262 Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>	2018-07-04 13:41:16 +01:00
Michael Khalil	78e0784d04	return error exit status in prometheus cli (#4296 ) Signed-off-by: mikeykhalil <mikeyfkhalil@gmail.com>	2018-06-21 08:32:26 +01:00
Tom Wilkie	8acad5f3cd	make it compile Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>	2018-05-24 15:40:24 +01:00
Tom Wilkie	e51d6c4b6c	Make remote flush deadline a command line param. Signed-off-by: Tom Wilkie <tom.wilkie@gmail.com>	2018-05-23 15:06:01 +01:00
Sneha Inguva	c1a851074b	promtool: add query instant and query range commands (#4085 ) * promtool: add QueryInstant and QueryRange cmds * promtool: add more query functions * promtool: finished query Instant * promtool: add range query * promtool: add query command and address arguments * vendor client and api	2018-04-26 20:41:56 +02:00
Mario Trangoni	464e747f1e	fix some comments typos (#4059 )	2018-04-08 10:51:54 +01:00
Sneha Inguva	7be846754a	main: actor functionality comments	2018-04-01 11:19:30 -07:00
Marek Siarkowicz	bb86c3f62b	Report internal runtime information on status page (#3921 ) Add information about tsdb, wal and config reload	2018-03-21 16:08:37 +00:00
James Turnbull	ba5273a0ab	Minor edits to help text (#3990 )	2018-03-20 16:54:36 +00:00
Simon Pasquier	e1fd96db25	cmd: fix help text (#3989 )	2018-03-20 15:58:19 +00:00
ferhat elmas	ffa673f7d8	General simplifications (#3887 ) Another try as in #1516	2018-02-26 07:58:10 +00:00
Bartek Plotka	93a63ac5fd	api: Added v1/status/flags endpoint. (#3864 ) Endpoint URL: /api/v1/status/flags Example Output: ```json { "status": "success", "data": { "alertmanager.notification-queue-capacity": "10000", "alertmanager.timeout": "10s", "completion-bash": "false", "completion-script-bash": "false", "completion-script-zsh": "false", "config.file": "my_cool_prometheus.yaml", "help": "false", "help-long": "false", "help-man": "false", "log.level": "info", "query.lookback-delta": "5m", "query.max-concurrency": "20", "query.timeout": "2m", "storage.tsdb.max-block-duration": "36h", "storage.tsdb.min-block-duration": "2h", "storage.tsdb.no-lockfile": "false", "storage.tsdb.path": "data/", "storage.tsdb.retention": "15d", "version": "false", "web.console.libraries": "console_libraries", "web.console.templates": "consoles", "web.enable-admin-api": "false", "web.enable-lifecycle": "false", "web.external-url": "", "web.listen-address": "0.0.0.0:9090", "web.max-connections": "512", "web.read-timeout": "5m", "web.route-prefix": "/", "web.user-assets": "" } } ``` Signed-off-by: Bartek Plotka <bwplotka@gmail.com>	2018-02-21 08:49:02 +00:00
Fabian Reinartz	7ccd4b39b8	*: implement query params This adds a parameter to the storage selection interface which allows query engine(s) to pass information about the operations surrounding a data selection. This can for example be used by remote storage backends to infer the correct downsampling aggregates that need to be provided.	2018-02-13 12:17:22 +01:00
Conor Broderick	5169ccf258	Merge pull request #3724 from simonpasquier/fix-bad-data-error Don't reset FiredAt for inactive alerts	2018-02-01 16:18:09 +00:00
Krasi Georgiev	b75428ec19	rename package retrieve to scrape no fucnctinal changes just renaming retrieval to scrape	2018-02-01 09:55:07 +00:00
Krasi Georgiev	7858745c04	rename structs for consistency	2018-01-30 17:49:05 +00:00
Krasi Georgiev	acc4197098	remove dicovery race for the context field	2018-01-29 15:18:07 +00:00
Julien Pivotto	8b20cb1e8d	last config success time gauge: use SetToCurrentTime() (#3750 ) Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2018-01-27 07:48:13 +00:00
Simon Pasquier	81c0ab69e0	Don't reset FiredAt for inactive alerts Otherwise AlertManager receives resolved alerts where StartsAt is zero which fails the validation.	2018-01-22 17:17:33 +01:00
Krasi Georgiev	719c579f7b	refactor main execution reloadReady handling, update some comments	2018-01-17 18:14:24 +00:00
Krasi Georgiev	0eafaf32d3	set the correct config reloading execution for scraper and notifier	2018-01-17 13:06:56 +00:00
Krasi Georgiev	97f0461e29	refactor the config reloading execution	2018-01-17 12:02:13 +00:00
Krasi Georgiev	5260c650ec	use the config hash for the map lookup	2018-01-16 11:10:54 +00:00
Krasi Georgiev	8369826808	comment to rethink the map reference for the notifier discovery	2018-01-16 09:47:53 +00:00
Krasi Georgiev	d12e6f29fc	discovery manager ApplyConfig now takes a direct ServiceDiscoveryConfig so that it can be used for the notify manager reimplement the service discovery for the notify manager Signed-off-by: Krasi Georgiev <krasi.root@gmail.com>	2018-01-15 13:39:44 +00:00
Shubheksha Jalan	0471e64ad1	Use shared types from the `common` repo (#3674 ) * refactor: use shared types from common repo, remove util/config * vendor: add common/config * fix nit	2018-01-11 16:10:25 +01:00
Goutham Veeramachaneni	35a6ffbaf3	Merge pull request #3587 from krasi-georgiev/web-test-error-check handle web_test webhandler errors.	2018-01-10 22:03:25 +05:30
Shubheksha Jalan	ec94df49d4	Refactor SD configuration to remove `config` dependency (#3629 ) * refactor: move targetGroup struct and CheckOverflow() to their own package * refactor: move auth and security related structs to a utility package, fix import error in utility package * refactor: Azure SD, remove SD struct from config * refactor: DNS SD, remove SD struct from config into dns package * refactor: ec2 SD, move SD struct from config into the ec2 package * refactor: file SD, move SD struct from config to file discovery package * refactor: gce, move SD struct from config to gce discovery package * refactor: move HTTPClientConfig and URL into util/config, fix import error in httputil * refactor: consul, move SD struct from config into consul discovery package * refactor: marathon, move SD struct from config into marathon discovery package * refactor: triton, move SD struct from config to triton discovery package, fix test * refactor: zookeeper, move SD structs from config to zookeeper discovery package * refactor: openstack, remove SD struct from config, move into openstack discovery package * refactor: kubernetes, move SD struct from config into kubernetes discovery package * refactor: notifier, use targetgroup package instead of config * refactor: tests for file, marathon, triton SD - use targetgroup package instead of config.TargetGroup * refactor: retrieval, use targetgroup package instead of config.TargetGroup * refactor: storage, use config util package * refactor: discovery manager, use targetgroup package instead of config.TargetGroup * refactor: use HTTPClient and TLS config from configUtil instead of config * refactor: tests, use targetgroup package instead of config.TargetGroup * refactor: fix tagetgroup.Group pointers that were removed by mistake * refactor: openstack, kubernetes: drop prefixes * refactor: remove import aliases forced due to vscode bug * refactor: move main SD struct out of config into discovery/config * refactor: rename configUtil to config_util * refactor: rename yamlUtil to yaml_config * refactor: kubernetes, remove prefixes * refactor: move the TargetGroup package to discovery/ * refactor: fix order of imports	2017-12-29 21:01:34 +01:00
Brian Brazil	ecc24b554d	Hide block duration flags. (#3618 ) Users are starting to use these mistakenly thinking they'll help with issues, and thus causing some confusion. Thus hide them and make it clear that they're only there for testing reasons.	2017-12-24 12:13:48 +00:00
Krasi Georgiev	c94fa731aa	bypass the proxy for the tests	2017-12-20 18:21:10 +00:00
Krasi Georgiev	ad66476c4f	fix flaky main.go test and simplify a bit	2017-12-19 15:07:49 +00:00
Fabian Reinartz	2881d73ed8	Merge pull request #3362 from krasi-georgiev/discovery-refactoring Decouple the discovery and refactor the retrieval package	2017-12-19 12:56:34 +01:00

... 2 3 4 5 6 ...

558 commits