prometheus

mirror of https://github.com/prometheus/prometheus.git synced 2024-11-10 23:54:05 -08:00

Author	SHA1	Message	Date
Mitsuhiro Tanda	296644adeb	Expose ec2_instance_type (#2107 )	2016-10-21 11:13:47 +01:00
Dominik Schulz	36de163900	Add File-SD metrics (#2103 ) * Add File-SD metrics * Count read errors, not scan errors.	2016-10-21 11:12:19 +01:00
Dominik Schulz	3d0fb0cf17	Avoid too generic label type.	2016-10-21 12:11:15 +02:00
Dominik Schulz	e1e30f12cd	Add Kubernetes-SD metrics.	2016-10-21 10:48:28 +02:00
Dominik Schulz	552ab61fa1	Change SD metric names to make logical grouping more visible. (#2102 )	2016-10-21 09:18:28 +01:00
Dominik Schulz	0c69227616	Add Consul-SD metrics (#2097 ) * Add Consul-SD metrics * Remove unnecessary metric and add labels to summary. * Do not stutter	2016-10-21 08:59:43 +01:00
Dominik Schulz	255a8c8b4c	Fix small typo in EC2 SD metric name (#2100 )	2016-10-20 09:01:00 +01:00
Dominik Schulz	00e486a05b	Add Azure-SD metrics (#2099 )	2016-10-20 08:23:50 +01:00
Dominik Schulz	163d5a8977	Add EC2 SD metrics (#2095 ) * Add EC2 SD metrics * Address review comments	2016-10-19 10:20:00 +01:00
Fabian Reinartz	3c8140f2e6	kubernetes: fix typo in endpoint switch case	2016-10-18 16:20:26 +02:00
bekbulatov	ac702f66eb	Resolve merge conflicts	2016-10-18 14:14:24 +01:00
Fabian Reinartz	228bfc1bb5	Merge pull request #2040 from prometheus/kubernetes Add K8S v2 pod discovery	2016-10-17 20:09:22 +02:00
Fabian Reinartz	ce45040e47	kubernetes: fix missing port labels This commit fixes endpoint port labeling, adjusts tests accordingly and enhances test delta printing	2016-10-17 11:05:13 +02:00
Frederic Branczyk	8f576a8510	retrieval: add kubernetes endpoint discovery tests	2016-10-17 10:32:10 +02:00
Frederic Branczyk	08fa4eaa92	retrieval: add kubernetes pod discovery tests	2016-10-17 10:32:10 +02:00
Frederic Branczyk	3762e39ce5	retrieval: add kubernetes service discovery tests	2016-10-17 10:32:10 +02:00
Frederic Branczyk	397072a482	retrieval: add kubernetes node discovery tests	2016-10-17 10:32:10 +02:00
Frederic Branczyk	cc46058802	retrieval: kubernetes nodes are not namespaced	2016-10-17 10:32:10 +02:00
Frederic Branczyk	a318d9ad27	retrieval: fix pod label and annotation prefixes	2016-10-17 10:32:10 +02:00
Fabian Reinartz	b24602f713	kubernetes: merge back into single configuration	2016-10-17 10:32:10 +02:00
Fabian Reinartz	a9cfb66b28	kubernetes: add node discovery	2016-10-17 10:32:10 +02:00
Fabian Reinartz	d896a654f9	kubernetes: Add discovery of services	2016-10-17 10:32:10 +02:00
Fabian Reinartz	6d269ed870	kubernetes: infer pod information in endpoints discovery	2016-10-17 10:32:10 +02:00
Fabian Reinartz	7c439a9060	kubernetes: use and vendor 1.5 client	2016-10-17 10:32:10 +02:00
Fabian Reinartz	de22524e57	kubernetes: add KubernetesV2 endpoints	2016-10-17 10:32:10 +02:00
Fabian Reinartz	2331701b50	kubernetes: Add K8S v2 pod discovery This adds plumbing for a parallel version of the new K8S SD and adds pod discovery as the first role.	2016-10-17 10:32:10 +02:00
Dominik Schulz	bfa7099616	Report GCE instance metdata (#2084 ) * Report GCE instance metdata * Fix spelling acording to code review guidelines * Address review comments	2016-10-17 09:45:43 +02:00
Dominik Schulz	c73aa82589	Add GCE Instance Status	2016-10-08 08:40:12 +02:00
bekbulatov	01b53c1180	Add tls support	2016-10-07 13:40:22 +01:00
Roman Vynar	db63a4bd2a	Do not fail Consul discovery on Prometheus startup when Consul is down.	2016-09-26 22:20:56 +03:00
Dominik Schulz	f6fbcf9aa2	Expose ec2_instance_state	2016-09-22 15:01:23 +02:00
Tom Wilkie	4520e12440	Add HTTP Basic Auth & TLS support to the generic write path. (#1957 ) * Add config, HTTP Basic Auth and TLS support to the generic write path. - Move generic write path configuration to the config file - Factor out config.TLSConfig -> tlf.Config translation - Support TLSConfig for generic remote storage - Rename Run to Start, and make it non-blocking. - Dedupe code in httputil for TLS config. - Make remote queue metrics global.	2016-09-19 22:47:51 +02:00
Matt Bostock	4fc619b605	Scrape: Remove JSON from Accept request header JSON is no longer supported as an exposition format [1] [2] [3]. Remove it from the `Accept` header added to requests when scraping targets. [1]: https://github.com/prometheus/prometheus/blob/master/CHANGELOG.md#100--2016-07-18 [2]: https://prometheus.io/docs/instrumenting/exposition_formats/#historical-versions [3]: https://docs.google.com/document/d/1ZjyKiKxZV83VI9ZKAXRGKaUKK2BIWCT7oiGBKDBpjEY/edit?usp=sharing	2016-09-17 10:28:03 +01:00
Ingo Gottwald	3b546d061f	Add support for GCE discovery	2016-09-16 08:55:33 +02:00
Tobias Schmidt	29ced0090f	Fix common english misspellings	2016-09-14 23:23:28 -04:00
Tobias Schmidt	27074863b4	Print url.URLs correctly in tests	2016-09-14 23:15:18 -04:00
Tobias Schmidt	8f3b62bfe4	Simplify struct initialization	2016-09-14 23:13:27 -04:00
Dan Milstein	0cb6b9962e	Fix broken test which relied on DNS resolution #1962 Switched to testing by way of the static_configs rather than dns_sd_config parameter. Verified that the revised test both passes without network access, and also still catches the bug it's supposed to cover.	2016-09-08 16:59:46 -04:00
Fabian Reinartz	fec3b54cfc	Merge pull request #1946 from prometheus/ipv6 Fix IPv6 scraping	2016-09-06 17:18:28 +02:00
Fabian Reinartz	a15237a0b8	retrieval: correctly handle IPv6 addresses This updates all service discoveries to correctly build the __address__ label for IPv6 addresses.	2016-09-06 15:06:49 +02:00
Fabian Reinartz	17cdd4f966	retrieval: fix IPv6 port default, add tests This fixes port defaulting for IPv6 addresses and restructures and test the construction of target label sets.	2016-09-06 15:06:48 +02:00
Fabian Reinartz	0322c59dc3	retrieval: export NewHTTPClient	2016-09-05 16:44:40 +02:00
Dan Milstein	b9fb9742ed	Move test helper function into scope of test func	2016-08-29 16:08:40 -04:00
Dan Milstein	79216011cb	Add basic test for TargetManager.targetSet Verify that if the configs change, target groups are cleaned on TargetManager.reload (rather than having old ones linger around, even if they are no longer present in the configs). This covers the bug fixed in #1907 -- I verified that by checking out source from before that commit. This is a start on #1906	2016-08-26 14:30:26 -04:00
Björn Rabenstein	4b8f963847	Merge pull request #1915 from prometheus/release-1.0 Forward-merge the bug fix from release-1.0	2016-08-24 13:04:45 +02:00
beorn7	e2b3626e0c	retrieval: Clean up target group map on config reload Also, remove unused `providers` field in targetSet. If the config file changes, we recreate all providers (by calling `providersFromConfig`) and retrieve all targets anew from the newly created providers. From that perspective, it cannot harm to clean up the target group map in the targetSet. Not doing so (as it was the case so far) keeps stale targets around. This mattered if an existing key in the target group map was not overwritten in the initial fetch of all targets from the providers. Examples where that mattered: ``` scrape_configs: - job_name: "foo" static_configs: - targets: ["foo:9090"] - targets: ["bar:9090"] ``` updated to: ``` scrape_configs: - job_name: "foo" static_configs: - targets: ["foo:9090"] ``` `bar:9090` would still be monitored. (The static provider just enumerates the target groups. If the number of target groups decreases, the old ones stay around. ``` scrape_configs: - job_name: "foo" dns_sd_configs: - names: - "srv.name.one.example.org" ``` updated to: ``` scrape_configs: - job_name: "foo" dns_sd_configs: - names: - "srv.name.two.example.org" ``` Now both SRV records are still monitored. The SRV name is part of the key in the target group map, thus the new one is just added and the old ane stays around. Obviously, this should have tests, and should have tests before, not only for this case. This is the quick fix. I have created https://github.com/prometheus/prometheus/issues/1906 to track test creation. Fixes https://github.com/prometheus/prometheus/issues/1610 .	2016-08-22 19:25:33 +02:00
Anders Daljord Morken	95cadd0702	Run scrape loop with interval 1 instead of 0 0 is considered an invalid interval by time.NewTicker() and will cause a panic if control reaches that point. Given the vagaries of timekeeping, this may occasionally happen and make this test unstable.	2016-08-18 09:39:11 +02:00
Anders Daljord Morken	8633ac180e	Strip stray whitespace from bearer token file Apart from not trying to send a newline in a HTTP header, this also allows Prometheus to build and pass tests with Go 1.7, which features stricter checking of HTTP headers.	2016-08-17 15:36:18 +02:00
Frederic Branczyk	7714b9c781	move relabeling functionality to its own package also remove the returned error as it was always nil	2016-08-09 14:19:20 +02:00
Jimmi Dyson	6c8080607f	Kubernetes SD: Add node name and host IP to pod discovery	2016-07-20 12:00:54 +01:00
Dmitry Vorobev	273e457da4	web: return status code and error message for config resource	2016-07-15 10:15:24 +02:00
beorn7	064b57858e	Consistently use the `Seconds()` method for conversion of durations This also fixes one remaining case of recording integral numbers of seconds only for a metric, i.e. this will probably fix #1796.	2016-07-07 15:24:35 +02:00
Fabian Reinartz	4591a2623b	discovery/kubernetes: filter pod/container, service/endpoint This change distinguishes and filters by pod/container and service/endpoint in the respective sub-SDs.	2016-07-05 14:24:17 +02:00
Fabian Reinartz	0ff354341b	discovery/kubernetes: remove unused channel	2016-07-05 14:22:12 +02:00
Fabian Reinartz	7221228843	discovery/kubernetes: select between discovery role This adds `role` field to the Kubernetes SD config, which indicates which type of Kubernetes SD should be run. This no longer allows discovering pods and nodes with the same SD configuration for example.	2016-07-05 14:22:12 +02:00
Fabian Reinartz	e0f8caacd7	discovery/kubernetes: extract service endpoint discovery This extract discovery of services and their endpoints into its own type.	2016-07-05 10:26:23 +02:00
Fabian Reinartz	fdbe28df85	discovery/kubernetes: extract node discovery This change extracts node discovery into its own type.	2016-07-01 19:31:04 +02:00
Fabian Reinartz	8a97c211a8	discovery/kubernetes: extract pod discovery This change extracts pod discovery into its own type.	2016-07-01 19:30:00 +02:00
Fabian Reinartz	e03e138d34	discovery: consolidate constructors into single file	2016-07-01 19:30:00 +02:00
Fabian Reinartz	57333d1831	discovery/kubernetes: add missing locking	2016-07-01 17:07:13 +02:00
Fabian Reinartz	44036a08d0	Merge pull request #1725 from nicholascapo/use-consul-service-address discovery: use consul service address if available	2016-06-30 09:30:14 +02:00
Nicholas Capo	84334a8410	discovery: use consul service address if available	2016-06-15 19:27:05 -05:00
Fabian Reinartz	4aeab798e8	Merge pull request #1738 from prometheus/release-0.19 Forward-merge 0.19 fixes into master	2016-06-14 18:11:47 +02:00
Fabian Reinartz	3c80609fce	Merge pull request #1737 from prometheus/fabxc-0.19.3 Bump version to 0.19.3	2016-06-14 18:04:56 +02:00
Fabian Reinartz	4c864c8a88	retrieval: don't sync to uninitialized scrape pool This change does just signal a scrape target update to the scraping loop once an initial target set is fetched. Before, the scrape pool was directly synced, causing a race against an uninitialized scrape pool. Fixes #1703	2016-06-14 14:18:40 +02:00
Fabian Reinartz	d0eeae9d0e	retrieval: don't sync to uninitialized scrape pool This change does just signal a scrape target update to the scraping loop once an initial target set is fetched. Before, the scrape pool was directly synced, causing a race against an uninitialized scrape pool. Fixes #1703	2016-06-14 14:04:22 +02:00
beorn7	03adbe57e4	discovery/marathon: Fix race conditions in test The concurrency applied before is in most cases not even needed. With a cap=1 channel, most tests are much cleaner. TestMarathonSDRunAndStop was trickier. It could even have blocked before. This also includes a general refactoring of the whole file.	2016-06-14 13:12:53 +02:00
Björn Rabenstein	2ea3a837c3	Merge pull request #1731 from prometheus/release-0.19 Forward-merge 0.19 fixes to master	2016-06-14 09:21:52 +02:00
rohit01	47dd5f74ba	discovery/marathon: #1722 - ignore apps with zero ports	2016-06-14 04:44:00 +05:30
Fabian Reinartz	0f21bd31ca	config: deprecate `target_groups` for `static_configs` This change deprecates the `target_groups` option in favor of `static_configs`. The old configuration is still accepted but prints a warning. Configuration loading errors if both options are set.	2016-06-08 15:55:25 +02:00
Brian Brazil	05b918a024	Merge pull request #1713 from mattbostock/document_drop Relabel: Document whole label set is dropped	2016-06-07 17:22:34 +01:00
Matt Bostock	329a00e44f	Relabel: Document whole label set is dropped From the documentation and current tests, it wasn't immediately clear to me whether the `target` being dropped as the result of a 'drop' action was a label key-value pair or the entire labelset. Add a test that documents this behaviour. Documentation: https://prometheus.io/docs/operating/configuration/	2016-06-07 17:13:30 +01:00
Jimmi Dyson	206bcfcdaa	Kubernetes SD: Remove kubeletPort config option	2016-06-07 12:34:55 +01:00
Jimmi Dyson	d48297c904	Kubernetes SD: Add labels for all node addresses and discover node port if available	2016-06-07 12:34:49 +01:00
Fabian Reinartz	26b1c89469	Merge pull request #1702 from pdbogen/master Initial local pods before using the pod list to initialize pod targets	2016-06-06 15:14:49 +02:00
Patrick Bogen	1e6770cdc5	Initial local pods before using the pod list to initialize pod targets; include more logging in pod target creation	2016-06-02 17:49:21 -07:00
Ali Reza	c81b4e8a87	change config names to files for consistency	2016-05-30 07:47:58 +07:00
Gregory G. Tseng	4ceedffe86	Unexport testing constant	2016-05-26 14:42:17 -07:00
Gregory G. Tseng	7997c14b0d	Add ServerName into TLS Config	2016-05-26 14:24:49 -07:00
Fabian Reinartz	74c448386c	Merge pull request #1665 from prometheus/fabxc-retrpanic Fix kubernetes SD crash	2016-05-25 17:13:27 -07:00
Fabian Reinartz	12b03db373	retrieval: handle nil target groups from updates	2016-05-25 16:59:16 -07:00
Fabian Reinartz	ea36efbbd1	retrieval: document panic behavior	2016-05-25 16:17:25 -07:00
Fabian Reinartz	a5ba166935	retrieval: don't panic on non-HTTP scheme	2016-05-25 16:05:20 -07:00
Tobias Schmidt	0c6ed9d437	Fix type usage in Kubernetes discovery The event types have been recently changed to be exported types which wasn't reflected in the new pod discovery.	2016-05-20 11:03:19 -04:00
Fabian Reinartz	f7ed2ff706	Merge pull request #1644 from prometheus/beorn7/logging Add missing logging of out-of-order samples	2016-05-20 05:52:00 -07:00
Fabian Reinartz	dec56838fc	Merge pull request #1449 from pdbogen/master k8s pod discovery	2016-05-20 05:45:09 -07:00
Patrick Bogen	89940eb48d	Write tests to include testing determinancy of various slice orders; ensure that container order is deterministic	2016-05-19 10:57:23 -07:00
Patrick Bogen	b3350d872a	Add one label named for each port name, mapping it to port number; add corresponding tests; prefix port list label with a comma	2016-05-19 10:37:11 -07:00
beorn7	d43c0159aa	Fix style issues in retrieval/...	2016-05-19 17:14:04 +02:00
beorn7	45e5775f9b	Add missing logging of out-of-order samples So far, out-of-order samples during rule evaluation were not logged, and neither scrape health samples. The latter are unlikely to cause any errors. That's why I'm logging them always now. (It's alway highly irregular should it happen.) For rules, I have used the same plumbing as for samples, just with a different wording in the message to mark them as a result of rule evaluation.	2016-05-19 16:22:53 +02:00
Patrick Bogen	ae413704e8	kubernetes pod-level discovery	2016-05-18 17:18:52 -07:00
Fabian Reinartz	2ca9ee7b0d	Merge pull request #1612 from prometheus/stn-dns-mv [discovery][dns] move dns to own package	2016-05-06 12:23:59 +02:00
stuart nelson	d959d2b90a	discovery/dns: Maintain argument order consistency	2016-05-06 11:14:26 +02:00
stuart nelson	d02591814b	discovery/dns: move dns to own package	2016-05-06 11:14:26 +02:00
Shawn Smith	d7481f266e	fix typo	2016-05-04 13:42:27 +09:00
Tobias Schmidt	3ce895e86d	Merge pull request #1608 from prometheus/fix-printf Fix format argument in retrieval test.	2016-05-01 17:44:16 -04:00
Julius Volz	97b018d26d	Fix format argument in retrieval test.	2016-05-01 23:37:45 +02:00
Fabian Reinartz	f94fc76608	Merge pull request #1592 from prometheus/fabxc-consul-ref discovery: sanitize Consul service discovery	2016-04-30 21:18:33 +02:00
Fabian Reinartz	289f306dd9	Merge pull request #1590 from prometheus/fabxc-marathon-ref Fix basic issues in marathon SD	2016-04-30 21:17:22 +02:00
Fabian Reinartz	76076bfb47	discovery: simplify client initialization	2016-04-30 21:07:49 +02:00
Fabian Reinartz	b5bfb502df	discovery: properly check context on chan send	2016-04-30 11:57:20 +02:00
Fabian Reinartz	9f8feb9ff6	discovery: consolidate Marathon SD files	2016-04-30 11:56:11 +02:00
Fabian Reinartz	086f7caceb	discovery: extract Consul shouldWatch logic	2016-04-30 11:50:19 +02:00
Fabian Reinartz	e805e68c01	discovery: sanitize Consul service discovery This commits simplifies the SD's structure and ensures that all channel sends are checked against a canceled context.	2016-04-30 11:50:19 +02:00
Fabian Reinartz	5837e6a97f	discovery: move consul SD into own package	2016-04-25 16:56:27 +02:00
beorn7	d566808d40	Bring back logging of discarded samples But only on DEBUG level. Also, count and report the two cases of out-of-order timestamps on the one hand and same timestamp but different value on the other hand separately.	2016-04-25 16:43:52 +02:00
Fabian Reinartz	585ab6b163	Merge pull request #1494 from iamseth/master Add discovery capability for Microsoft Azure	2016-04-21 13:49:44 +02:00
Jonathan Boulle	38098f8c95	Add missing license headers Prometheus is Apache 2 licensed, and most source files have the appropriate copyright license header, but some were missing it without apparent reason. Correct that by adding it.	2016-04-13 16:08:22 +02:00
Seth Miller	0988e3b937	Add support for Azure discovery This change adds the ability to do target discovery with Microsoft's Azure platform.	2016-04-06 22:47:02 -05:00
Fabian Reinartz	769389e559	Fix potential race in ctx intialization	2016-04-05 20:27:31 +02:00
Tobias Schmidt	e82ef154ee	Remove unused code leftovers	2016-04-02 20:20:55 -04:00
stuart nelson	dbe5d18b6e	Instrument scrape pool `sync()` Instruments: - duration - count	2016-03-14 18:30:16 +01:00
stuart nelson	813f61e551	Merge pull request #1484 from prometheus/instrument-retrieval Instrument retrieval/scrape.go	2016-03-11 12:26:00 +01:00
stuart nelson	a1ee77601a	Instrument the duration of the `reload` function	2016-03-11 12:12:42 +01:00
Fabian Reinartz	895f2f092f	Fix flaky scrape test t	2016-03-09 16:00:33 +01:00
Fabian Reinartz	f2e359962c	Sort exported targets	2016-03-08 17:12:27 +01:00
Fabian Reinartz	56fc9bdff3	Handle closed target provider channel This fixes the case where a target provider closes the update channel and exits before the context is canceled. This should only be true for the static provider but it's safer to generally handle this case.	2016-03-08 15:49:03 +01:00
beorn7	d44b83690e	Fix flaky file-sd test	2016-03-07 15:39:18 +01:00
Fabian Reinartz	ddc74f712b	Add sortable target list	2016-03-02 09:10:20 +01:00
Fabian Reinartz	499f4af4aa	Test target URL	2016-03-01 14:49:57 +01:00
Fabian Reinartz	50c2f20756	Add targetScraper tests	2016-03-01 14:33:28 +01:00
Fabian Reinartz	1ede7b9d72	Consolidate TargetStatus into Target. This commit simplifies the TargetHealth type and moves the target status into the target itself. This also removes a race where error and last scrape time could have been out of sync.	2016-03-01 14:33:21 +01:00
Fabian Reinartz	2060a0a15b	Turn target group members into plain lists. As the scrape pool deduplicates targets now, it is no longer necessary to store a hash map for members of each group.	2016-03-01 14:33:12 +01:00
Fabian Reinartz	0d7105abee	Remove scrape config from Target. This commit removes the scrapeConfig entirely from Target. All identity defining parameters are thus immutable now and the mutex can be removed.. Target identity is now correctly defined by the labels and the full URL. This in particular includes URL parameters that are not specified in the label set. Fingerprint is also removed from hash to remove an unnecessary tight coupling to the common/model package.	2016-03-01 14:32:57 +01:00
Fabian Reinartz	75681b691a	Extract HTTP client from Target. The HTTP client is the same across all targets with the same scrape configuration. Thus, this commit moves it into the scrape pool.	2016-03-01 14:31:57 +01:00
Fabian Reinartz	9bea27ae8a	Add scraping tests	2016-03-01 14:00:48 +01:00
Fabian Reinartz	76a8c6160d	Deduplicate targets in scrape pool. With this commit the scrape pool deduplicates incoming targets before scraping them. This way multiple target providers can produce the same target but it will be scraped only once.	2016-03-01 13:50:51 +01:00
Fabian Reinartz	84f74b9a84	Apply new scrape config on reload. This commit updates a target set's scrape configuration on reload. This will cause all running scrape loops to be stopped and started again with new parameters.	2016-03-01 13:50:51 +01:00
Fabian Reinartz	02f635dc24	Remove interval/timeout from Target internals	2016-03-01 13:50:51 +01:00
Fabian Reinartz	775316f8d2	Move appender construction from Target to scrapePool	2016-03-01 13:50:51 +01:00
Fabian Reinartz	fbe251c2df	Fix scrape interval length calculation	2016-03-01 13:48:36 +01:00
Fabian Reinartz	1a3253e8ed	Make scrape time unambigious. This commit changes the scraper interface to accept a timestamp so the reported timestamp by the caller and the timestamp attached to samples does not differ.	2016-03-01 13:48:36 +01:00
Fabian Reinartz	2bb8ef99d1	Test scrape loop behavior.	2016-03-01 13:48:36 +01:00
Fabian Reinartz	c7bbe95597	Remove outdated target tests	2016-03-01 13:48:36 +01:00
Fabian Reinartz	05de8b7f8d	Extract target scraping into scrape loop. This commit factors out the scrape loop handling into its own data structure. For the transition it will be directly attached to the target.	2016-03-01 13:48:36 +01:00
Fabian Reinartz	cebba3efbb	Simplify and fix TargetManager reloading	2016-03-01 13:48:36 +01:00
Fabian Reinartz	da99366f85	Consolidate Target.Update into constructor. The Target.Update method is no longer needed.	2016-03-01 13:48:36 +01:00
Fabian Reinartz	d15adfc917	Preserve target state across reloads. This commit moves Scraper handling into a separate scrapePool type. TargetSets only manage TargetProvider lifecycles and sync the retrieved updates to the scrapePool. TargetProviders are now expected to send a full initial target set within 5 seconds. The scrapePools preserve target state across reloads and only drop targets after the initial set was synced.	2016-03-01 13:48:36 +01:00
Fabian Reinartz	5b30bdb610	Change TargetProvider interface. This commit changes the TargetProvider interface to use a context.Context and send lists of TargetGroups, rather than single ones.	2016-03-01 13:48:36 +01:00
Fabian Reinartz	bb6dc3ff78	Remove old tests	2016-03-01 13:48:36 +01:00
Fabian Reinartz	5bfa4cdd46	Simplify target update handling. We group providers by their scrape configuration. Each provider produces target groups with an unique identifier. On stopping a set of target providers we cancel the target providers, stop scraping the targets and wait for the scrapers to finish. On configuration reload all provider sets are stopped and new ones are created. This will make targets disappear briefly on configuration reload. Potentially scrapes are missed but due to the consistent scrape intervals implemented recently, the impact is minor.	2016-03-01 13:48:36 +01:00
Jimmi Dyson	e59b7c15a3	Kubernetes SD: Fix node IP discovery	2016-03-01 12:24:52 +00:00
beorn7	33a50e69f7	Fix a deadlock Double acquisition of the RLock usually doesn't blow up, but if the write lock is called for between the two RLock's, we are deadlocked. This deadlock does not exist in release-0.17, BTW.	2016-02-29 16:34:29 +01:00
beorn7	fd5108b038	Fix a targetmanager test	2016-02-22 16:43:48 +01:00
Fabian Reinartz	6df1f49c13	Remove fullLabels method and fix target updating With recent changes to a Target's internal data representation updating by fullLabels() assigns the additional default instance label. This breaks target identity comparison and causes identical targets from service discovery to be constantly swapped.	2016-02-22 13:06:30 +01:00
Fabian Reinartz	825831e98f	Use fingerprint for target identity comparison So far we were using the InstanceIdentifier to compare equality of targets. This is not always accurate, for example for the blackbox exporter where the actual target is in the parameter.	2016-02-17 16:34:53 +01:00
Fabian Reinartz	66767121ab	Handle scrape timeout on request. For historic reasons we were enforcing a timeout directly via the TCP dialer. This is no longer necessary for quite a while now. Switching to context.Context will allow us to properly terminate requests on shutdown as well.	2016-02-16 11:46:02 +01:00
Julius Volz	293486c7b1	Remove old superfluous calls to setLastScrape(). This is called from within the scrape()->report() flow now. See https://github.com/prometheus/prometheus/pull/1394/files#r52945817	2016-02-15 22:42:24 +01:00
Fabian Reinartz	a0078ec84c	Merge pull request #1394 from prometheus/scraperef2 Refactor and test appender modifications	2016-02-15 21:19:40 +01:00
Fabian Reinartz	463dd3ea06	Refactor target scrape reporting.	2016-02-15 18:06:15 +01:00
Fabian Reinartz	cd28b88b08	Fix wrong EOF error on successful target scraping	2016-02-15 17:23:04 +01:00
Fabian Reinartz	27d71b08d1	Factor out appender wrapping	2016-02-15 16:47:39 +01:00
Fabian Reinartz	fe7e91e2eb	Make scraping offset consistent. To evenly distribute scraping load we currently rely on random jittering. This commit hashes over the target's identity and calculates a consistent offset. This also ensures that scrape intervals are constantly spaced between config/target changes.	2016-02-15 16:46:29 +01:00
Fabian Reinartz	a06bc75519	Remove occurrences of 'base' labels	2016-02-15 10:36:57 +01:00
Fabian Reinartz	0d44248fb8	Cleanup cluttered test data	2016-02-13 10:13:38 +01:00
Fabian Reinartz	65eba080a0	Cleanup internal target data	2016-02-13 10:13:38 +01:00
Julius Volz	9b6d69610a	Fix various typos in comments. Helpfully reported by https://goreportcard.com/report/github.com/prometheus/prometheus :)	2016-02-10 03:47:00 +01:00
Julius Volz	3728b5872f	Fix target update error handling. Fixes https://github.com/prometheus/prometheus/issues/1378	2016-02-08 21:42:59 +01:00
Fabian Reinartz	1f877f3d2a	Fix deadlock, structure target logging	2016-02-03 10:39:34 +01:00
Fabian Reinartz	d0d2c38c68	Fix tests for append API changes	2016-02-03 10:17:08 +01:00
Fabian Reinartz	59f1e722df	Return error on sample appending	2016-02-02 14:01:44 +01:00
Björn Rabenstein	9ea3897ea7	Merge pull request #1354 from prometheus/beorn7/storage Rework the way to communicate backpressure (AKA suspended ingestion)	2016-02-01 15:10:13 +01:00
beorn7	ec08c9a391	Rework the way to communicate backpressure (AKA suspended ingestion) This gives up on the idea to communicate throuh the Append() call (by either not returning as it is now or returning an error as suggested/explored elsewhere). Here I have added a Throttled() call, which has the advantage that it can be called before a whole _batch_ of Append()'s. Scrapes will happen completely or not at all. Same for rule group evaluations. That's a highly desired behavior (as discussed elsewhere). The code is even simpler now as the whole ingestion buffer could be removed. Logging of throttled mode has been streamlined and will create at most one message per minute.	2016-02-01 14:45:44 +01:00
beorn7	a7408bfb47	Unify duration parsing It's actually happening in several places (and for flags, we use the standard Go time.Duration...). This at least reduces all our home-grown parsing to one place (in model).	2016-01-29 15:41:50 +01:00
Jimmi Dyson	9faa7515c6	Kubernetes SD: Refactor to handle missing Kubernetes events	2016-01-19 20:49:58 +00:00
Brian Brazil	4a829e63a2	Merge pull request #1299 from PrFalken/master Support AirBnB's Smartstack Nerve client for SD	2016-01-18 13:31:04 +00:00
Julien Dehee	061fe2f364	Support AirBnB's Smartstack Nerve client for SD nerve's registration format differs from serverset. With this commit there is now a dedicated treecache file in util, and two separate files for serverset and nerve. Reference: https://github.com/airbnb/nerve	2016-01-18 14:07:28 +01:00
Brian Brazil	7a5f019c40	Use up/down in UI for consistency with 'up' metric.	2016-01-12 12:09:20 +00:00
Brian Brazil	6b7629be27	Merge pull request #1242 from tommyulfsparre/watcher-fix Reduces watches in serverset	2015-12-10 10:43:57 +00:00
Jimmi Dyson	c12fb447b8	Kubernetes SD: Use first TCP service port as target port & clean up example config Fixes #1256	2015-12-08 10:29:40 +00:00
Tommy Ulfsparre	83e09422bf	skip already watched child nodes.	2015-12-02 21:31:05 +01:00
Fabian Reinartz	29a69eecb8	Do not panic in Consul SD creation	2015-11-30 18:41:48 +01:00
Jimmi Dyson	2cca07381b	KubernetesSD: Create targets for services as well as service endpoints	2015-11-18 14:15:39 +00:00
Brian Brazil	427bf29db1	Add in default port after relabelling. For the SNMP and blackbox exporters where the ports tends to not be 80/443 and indeed there may not be a port this makes the relabelling a bit simpler as you don't have to figure out this logic exists and strip off the :80. This is a breaking change for the example configs of those exporters.	2015-11-08 11:42:18 +00:00
Brian Brazil	fd2bd81cd8	Allow all instance labels in target groups With the blackbox exporter, the instance label will commonly be used for things other than hostnames so remove this restriction. https://example.com or https://example.com/probe/me are some examples. To prevent user error, check that urls aren't provided as targets when there's no relabelling that could potentically fix them.	2015-11-07 14:35:20 +00:00
Fabian Reinartz	9cad147265	Merge pull request #1172 from federicobaldo/ec2_sd_improvements Minor improvements to ec2 service discovery	2015-11-04 13:02:51 +01:00
Federico Baldo	d14d2429ea	Minor improvements to ec2 sd: 1. static credentials replaced with defaults.DefaultChainCredentials. This change ensures that credentials are sourced form all possible providers available with the aws sdk, in the following order: env variables, shared awsconfig file in user folder, ec2 instance role. 2. Added a few labels: AvailabilityZone, PublicDns, VpcId (if available), SubnetId (if in Vpc)	2015-11-02 14:55:24 +01:00
Jimmi Dyson	87940ec213	Kubernetes SD: Rename `masters` to `api_servers` in config	2015-10-24 14:41:14 +01:00
Jimmi Dyson	7ff5cc66ea	Kubernetes SD authentication options cleanup	2015-10-23 16:47:52 +01:00
Jimmi Dyson	ea9a173008	Kubernetes SD: Use node name as instance label	2015-10-12 21:26:09 +01:00
Julius Volz	d88aea7e6f	Fix SD mechanism source prefix handling. The prefixed target provider changed a pointerized target group that was reused in the wrapped target provider, causing an ever-increasing chain of source prefixes in target groups from the Consul target provider. We now make this bug generally impossible by switching the target group channel from pointer to value type and thus ensuring that target groups are copied before being passed on to other parts of the system. I tried to not let the depointerization leak too far outside of the channel handling (both upstream and downstream) because I tried that initially and caused some nasty bugs, which I want to minimize. Fixes https://github.com/prometheus/prometheus/issues/1083	2015-10-09 14:08:22 +02:00
Julius Volz	dec9fc9c32	Merge pull request #1148 from prometheus/fix-serverset-multiple-paths Fix watching multiple Zookeeper paths in serverset SD.	2015-10-08 19:27:06 +02:00
Matt Jibson	dcb4856d72	Add SD for Amazon EC2 instances	2015-10-06 18:36:17 -04:00
Julius Volz	60cf4015a4	Fix watching multiple Zookeeper paths in serverset SD. Fix https://github.com/prometheus/prometheus/issues/1137	2015-10-06 15:54:54 +02:00
Fabian Reinartz	e3b6ec9784	Switch to common/log	2015-10-03 10:21:43 +02:00
Jimmi Dyson	0d61605526	Kubernetes SD example: separate out cluster level components & services	2015-09-29 11:22:18 +01:00
Julius Volz	99e8fff872	Fix target manager CPU busyloop caused by bad done-channel handling. Unfortunately this isn't nicely testable, as it's timing-dependent and one would have to detect a stray goroutine doing a CPU busyloop... Fixes https://github.com/prometheus/prometheus/issues/1114	2015-09-28 11:51:16 +02:00
Fabian Reinartz	097d810f37	Merge pull request #1120 from prometheus/flaky-test retrieval: Reduce flakiness of TestTargetRunScraperScrapes	2015-09-28 09:57:16 +02:00
Brian Brazil	ba6688bfce	retrieval: Reduce flakiness of TestTargetRunScraperScrapes	2015-09-28 08:34:54 +01:00
Brian Brazil	b03569267e	retrieval: Add URL parameters to fullLabels too Move all the special cases into one map, rather than spreading the logic around.	2015-09-26 16:59:24 +01:00
Brian Brazil	50258929ac	Retrieval: Show error message for failed test scrape This is flaky, and I suspect it was due the to I/O timeout that I've already fixed. In case that wasn't it, display the error should it happen again.	2015-09-23 09:24:50 +01:00
Brian Brazil	4bc39dc60e	retrieval: Reduce flakiness of TestTargetManagerChan This will increase test time by a few hundred ms, this is the 2nd most common cause of flakiness.	2015-09-23 09:00:37 +01:00
Brian Brazil	93145b960a	retrieval: Reduce flakiness of target tests Bump timeouts of tests where we don't want I/O timeouts. Adjust the full channel test to be much more reliable, by reducing the ingestion timeout from 1ms to 0.	2015-09-22 19:23:36 +01:00
Fabian Reinartz	cac6eea434	Merge pull request #1105 from prometheus/consulnil Fix nil panic on consul error	2015-09-22 14:55:31 +02:00
Fabian Reinartz	327152862c	Update expfmt.NewDecoder usage	2015-09-22 12:11:28 +02:00
Fabian Reinartz	1ce89a4a0b	Fix nil panic on consul error	2015-09-22 09:04:31 +02:00
Julius Volz	af513468eb	Fix some dead code, missing error checks, shadowings. I applied https://medium.com/@jgautheron/quality-pipeline-for-go-projects-497e34d6567 and was greeted with a deluge of warnings, most of which were not applicable or really fixable realistically. These are some of the first ones I decided to fix.	2015-09-14 12:21:34 +02:00
Jimmi Dyson	7ef9399920	Clean up kubernetes http response bodies	2015-09-11 11:44:28 +01:00
Anders Daljord Morken	9fb65a91af	Close HTTP connections on HTTP errors too. Move defer resp.Body.Close() up to make sure it's called even when the HTTP request returns something other than 200 or Decoder construction fails. This avoids leaking and eventually running out of file descriptors.	2015-09-10 22:41:05 +02:00
Fabian Reinartz	8456b7e12f	Use go1.5.1	2015-09-10 12:11:44 +02:00

... 2 3 4 5 6 ...

581 commits