prometheus

mirror of https://github.com/prometheus/prometheus.git synced 2025-03-05 20:59:13 -08:00

Author	SHA1	Message	Date
Julien Pivotto	ef4d8a38ca	Change metrics relabel terminology (#7362 ) Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-06-09 05:40:45 +01:00
Julien Pivotto	2209fa98b4	Fix consul_sd_config to follow types convention (#7316 ) Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-05-29 21:20:37 +02:00
Jop Zinkweg	1f69c38ba4	Add discovery support for triton compute nodes (#7250 ) Added optional configuration item role, defaults to 'container' (backwards-compatible). Setting role to 'cn' will discover compute nodes instead. Human-friendly compute node hostname discovery depends on cmon 1.7.0: `c1a2aeca36` Adjust testcases to use discovery config per case as two different types are now supported. Updated documentation: * new role setting * clarify what the name 'container' covers as triton uses different names in different locations Signed-off-by: jzinkweg <jzinkweg@gmail.com>	2020-05-22 16:19:21 +01:00
Harold Dost	18d45e564b	Documentation: Update example expressions to follow convention. (#7195 ) Based out of conversation on #7193 Signed-off-by: Harold Dost <h.dost@criteo.com>	2020-05-02 12:52:24 +01:00
Callum Styan	386aea7774	Add missing remote write/read config name to docs. (#7105 ) Signed-off-by: Callum Styan <callumstyan@gmail.com>	2020-04-14 09:27:33 -07:00
Frederic Hemberger	fe47c9c86e	[Docs] consul_sd_config: Add default value for `allow_stale` (#7075 ) Ref: https://github.com/prometheus/prometheus/blob/master/discovery/consul/consul.go#L97 Signed-off-by: Frederic Hemberger <mail@frederic-hemberger.de>	2020-03-31 18:55:25 +01:00
Deepjyoti Mondal	c38ca2ca95	Fix #6999 : Add architecture meta label for EC2 (#7000 ) This PR adds architecture meta labels for EC2 instances Signed-off-by: Deepjyoti Mondal <djmdeveloper060796@gmail.com>	2020-03-28 20:41:37 +00:00
Brian Brazil	445d48f4ce	Fix small docs typo (#7014 ) Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>	2020-03-20 12:11:32 +01:00
coding3min	4dfbf328f2	[OpenStack SD] Add HypervisorID meta labels about id (#6962 ) Add extra meta labels which will be useful in the case Prometheus discovery hypervisor . Signed-off-by: pzqu <pzqu@qq.com> Co-authored-by: pzqu <pzqu@example.com>	2020-03-11 08:38:14 +00:00
Alex Gaganov	df92a00838	Expose EC2 instance lifecycle as label (#6914 ) Signed-off-by: Alex Gaganov <alex.gaganov@fiverr.com>	2020-03-03 08:03:16 +00:00
李国忠	029b45aa30	add service type metadata to kubernetes_sd_config service role #6496 (#6684 ) * [service discovery] add service type metadata to kubernetes_sd_config service role Signed-off-by: fuling <fuling.lgz@alibaba-inc.com> * [fix] ServiceType -> string Signed-off-by: fuling <fuling.lgz@alibaba-inc.com> * [fix] fix testcase Signed-off-by: fuling <fuling.lgz@alibaba-inc.com> * [style] Signed-off-by: fuling <fuling.lgz@alibaba-inc.com> * [doc] add service type Signed-off-by: fuling <fuling.lgz@alibaba-inc.com> * [doc] sort Signed-off-by: fuling <fuling.lgz@alibaba-inc.com>	2020-02-25 09:22:14 +01:00
Aleksandra Gacek	8e53c19f9c	discovery/kubernetes: expose label_selector and field_selector Close #6807 Co-authored-by @shuttie Signed-off-by: Aleksandra Gacek <algacek@google.com>	2020-02-15 14:57:56 +01:00
Grebennikov Roman	b4445ff03f	discovery/kubernetes: expose label_selector and field_selector Closes #6096 Signed-off-by: Grebennikov Roman <grv@dfdx.me>	2020-02-15 14:57:38 +01:00
Andrew Hayworth	a336908678	Adds link to valid metric names (#6774 ) One of our users today asked us if dashes were allowed in recording rule names. We asserted that they were not, but also that we could not remember for certain. After determining empirically that they are _not_ allowed, I realized that the documentation could be slightly clearer about valid rule names. This PR simply adds a note to the documentation re-iterating that the rules must be valid metric names - and more importantly, adds a link to where a user can read what those are, in case they were not aware (or did not know where to find it). Signed-off-by: Andrew Hayworth <ahayworth@gmail.com>	2020-02-07 07:32:15 +00:00
Julien Pivotto	9d9bc524e5	Add query log (#6520 ) * Add query log, make stats logged in JSON like in the API Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2020-01-08 13:28:43 +00:00
Kyle Hinton	16f1e252f4	Small grammar fix on alerting rules doc (#6104 ) Signed-off-by: Kyle Hinton <kyle.hinton0@gmail.com>	2019-10-07 10:17:36 +02:00
Simon Pasquier	f6f23a2675	docs: update unit testing rules (#6051 ) * docs: update unit testing rules Signed-off-by: Simon Pasquier <spasquie@redhat.com> * More nits fixed Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-09-25 09:26:53 +02:00
li mengyang	1c6d2194c4	fix spelling mistakes in docs (#5952 ) Signed-off-by: hwdef <hwdef97@gmail.com>	2019-08-27 11:33:40 -06:00
Chris Marchbanks	a6a55c433c	Improve desired shards calculation (#5763 ) The desired shards calculation now properly keeps track of the rate of pending samples, and uses the previously unused integralAccumulator to adjust for missing information in the desired shards calculation. Also, configure more capacity for each shard. The default 10 capacity causes shards to block on each other while sending remote requests. Default to a 500 sample capacity and explain in the documentation that having more capacity will help throughput. Signed-off-by: Chris Marchbanks <csmarchbanks@gmail.com>	2019-08-13 10:10:21 +01:00
Dan P	a9dea68ee6	removed document reference to meta label that doesnt exist in the kubernetes_sd (#5821 ) Signed-off-by: Dan Potepa <dan@danpotepa.co.uk>	2019-08-01 12:34:23 +01:00
beorn7	5973acd65d	Clarifying honor_labels documentation Previously, the wording could be misunderstood as setting honor_labels to "false" for federation. This also adds scraping the Pushgateway as a typical use case for honor_labels=true. Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-02 13:23:20 +02:00
Svend Sorensen	8d54650d06	Document behavior of empty ec2_sd_config region (#5711 ) Document the behavior of an empty `ec2_sd_config` `region` setting. If this is omitted or blank, the region is discovered from the instance metadata, if available. If it is blank and instance region metadata is not available, an error will result ("EC2 SD configuration requires a region"). Signed-off-by: Svend Sorensen <svend@svends.net>	2019-06-27 18:35:54 +01:00
Max Leonard Inden	41c22effbe	config&notifier: Add option to use Alertmanager API v2 With v0.16.0 Alertmanager introduced a new API (v2). This patch adds a configuration option for Prometheus to send alerts to the v2 endpoint instead of the defautl v1 endpoint. Signed-off-by: Max Leonard Inden <IndenML@gmail.com>	2019-06-21 16:33:53 +02:00
Björn Rabenstein	dc22f74153	Merge pull request #5608 from simonpasquier/external-labels-for-alert-tests cmd/promtool: add $externalLabels for alert unit tests	2019-06-20 16:48:12 +02:00
Björn Rabenstein	f3f016d464	Merge pull request #5604 from cstyan/default-capacity-docs Update queue config documentation	2019-06-17 13:05:14 +02:00
Ganesh Vernekar	5888066ffa	Merge pull request #5649 from cstyan/remove-queue-retries Remove max_retries from queue_config	2019-06-17 12:47:16 +05:30
Jens Erat	375aeb9158	Added humanizePercentage formatting to templates (#5670 ) Lots of alerts are based on ratios (eg. disk usage), and humans are used to values in percentage in textual descriptions. Signed-off-by: Jens Erat <email@jenserat.de>	2019-06-15 08:59:57 +01:00
Keenan Romain	55f3a9fe4a	Allows globs for rules when unit testing (#5595 ) * Includes glob support when unit testing rule_files. Signed-off-by: Keenan Romain <Keenan.Romain@mailchimp.com>	2019-06-12 11:31:07 +01:00
Callum Styan	e9129abeff	Remove max_retries from queue_config since it's not used in remote write anymore. Signed-off-by: Callum Styan <callumstyan@gmail.com>	2019-06-10 12:43:08 -07:00
Frederic Branczyk	9fc3c61e2c	Merge pull request #5598 from sh0rez/master include InitContainers in Kubernetes Service Discovery	2019-06-05 18:47:13 +02:00
Simon Pasquier	74ff35ccdd	cmd/promtool: add $externalLabels for alert unit tests Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-05-29 16:40:01 +02:00
sh0rez	8ba23fb336	fix(style): container_is_init to container_init Removes 'is' keyword to comply style guide Signed-off-by: sh0rez <me@shorez.de>	2019-05-29 16:16:19 +02:00
Carl Bergquist	9ba2f13c5e	fix inconsistant example rule (#5605 ) Signed-off-by: bergquist <carl.bergquist@gmail.com>	2019-05-29 10:46:00 +01:00
sh0rez	88b79bae64	chore(style): Comply with style guide, order list Signed-off-by: sh0rez <me@shorez.de>	2019-05-29 11:22:10 +02:00
Callum Styan	babb8a0572	Update queue config documentation to reflect default value change for capacity. Signed-off-by: Callum Styan <callumstyan@gmail.com>	2019-05-28 14:12:57 -07:00
sh0rez	1b144e499f	doc(discovery/kubernetes): container_is_init meta label Signed-off-by: sh0rez <me@shorez.de>	2019-05-28 16:52:13 +02:00
Bevisy	bdebb0c890	format markdown code block (#5594 ) Signed-off-by: bevisy <binbin36520@gmail.com>	2019-05-25 11:28:50 +01:00
Frederic Branczyk	04f22700b7	Merge pull request #5571 from simonpasquier/extend-k8s-endpoint-metadata discovery/kubernetes: add node name and hostname to endpoints	2019-05-16 20:19:29 +02:00
Samuel Alfageme	425b07f3c4	Updated the 'consistency-modes' consul.io/api link to point to its new location (#5572 ) Ref: `626392eb62` Signed-off-by: Samuel Alfageme <samuel@alfage.me>	2019-05-16 15:52:35 +01:00
Simon Pasquier	3441ecdea1	discovery/kubernetes: add node name and hostname to endpoints Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-05-16 10:49:13 +02:00
Simon Pasquier	9c69eec82a	cmd/promtool: use log.NewNopLogger() (#5531 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-05-03 10:00:07 +01:00
Björn Rabenstein	0be9388f8d	Merge pull request #5463 from prometheus/beorn7/templating Follow-up on #5009	2019-04-24 16:42:23 +02:00
EarthmanT	35be8c9e25	Add azure public ip label (#5475 ) * Update Azure SD Config with Public IP label Signed-off-by: earthmant <trammell@cloudify.co>	2019-04-17 16:05:44 +01:00
Bjoern Rabenstein	38d518c0fe	Rework #5009 after comments Signed-off-by: Bjoern Rabenstein <bjoern@rabenste.in>	2019-04-17 01:40:10 +02:00
Sylvain Rabot	335a34486e	Add external labels to template expansion This affects the expansion of templates in alert labels and annotations and console templates. Signed-off-by: Sylvain Rabot <sylvain@abstraction.fr>	2019-04-17 01:40:10 +02:00
Simon Pasquier	dafd1632a2	discovery/kubernetes: add present labels for labels/annotations (#5443 ) Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-04-10 13:21:42 +01:00
Kien Nguyen-Tuan	813b58367a	[OpenStack SD] Add ProjectID and UserID meta labels (#5431 ) Add extra meta labels which will be useful in the case Prometheus discovery instances from all projects. Signed-off-by: Kien Nguyen <kiennt2609@gmail.com>	2019-04-04 10:02:31 +01:00
Julien Pivotto	4397916cb2	Add honor_timestamps (#5304 ) Fixes #5302 Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2019-03-15 10:04:15 +00:00
Callum Styan	83c46fd549	update Consul vendor code so that catalog.ServiceMultipleTags can be (#5151 ) Signed-off-by: Callum Styan <callumstyan@gmail.com>	2019-03-12 10:31:27 +00:00
tuanvcw	9de0ab3c8a	Update remaining deprecated links in docs (#5271 ) Signed-off-by: Vu Cong Tuan <tuanvc@vn.fujitsu.com>	2019-02-26 10:16:38 +00:00
LongKB	e4a741cb7d	Replacing 'HTTP' by 'HTTPS' for securing links (#5252 ) Currently, when we access the modified pages with HTTP, it is redirected to HTTPS automatically. So this commit aims to replace HTTP to HTTPs for security. Co-Authored-By: Nguyen Phuong An <AnNP@vn.fujitsu.com> Signed-off-by: Kim Bao Long <longkb@vn.fujitsu.com>	2019-02-22 14:33:02 +01:00
LongKB	23480bef43	Remove the duplicated words (#5251 ) Although it is spelling mistakes, it might make an affects while reading. Co-Authored-By: Nguyen Phuong An <AnNP@vn.fujitsu.com> Signed-off-by: Kim Bao Long <longkb@vn.fujitsu.com>	2019-02-22 14:32:34 +01:00
Simon Pasquier	c8a1a5a93c	discovery/kubernetes: fix support for password_file and bearer_token_file (#5211 ) * discovery/kubernetes: fix support for password_file Signed-off-by: Simon Pasquier <spasquie@redhat.com> * Create and pass custom RoundTripper to Kubernetes client Signed-off-by: Simon Pasquier <spasquie@redhat.com> * Use inline HTTPClientConfig Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2019-02-20 11:22:34 +01:00
Kevin Bulebush	718344434c	openstack_sd: Supporting application credential for authentication. (#4968 ) * openstack_sd: Support application credentials for authentication. Updated gophercloud Signed-off-by: Kevin Bulebush <kmbulebu@gmail.com>	2019-01-09 15:18:58 +00:00
Fabian Reinartz	93d13c59d0	Sort Signed-off-by: Fabian Reinartz <freinartz@google.com>	2019-01-03 13:10:57 +01:00
Fabian Reinartz	7a41038695	Add Azure tenant and subscription ID labels Signed-off-by: Fabian Reinartz <freinartz@google.com>	2019-01-03 13:09:13 +01:00
Julien Pivotto	2e725a195a	Niptick about relabel config (#4994 ) Signed-off-by: Julien Pivotto <roidelapluie@inuits.eu>	2018-12-21 10:42:18 +00:00
Marcel D. Juhnke	c7d83b2b6a	discovery: add support for Managed Identity authentication in Azure SD (#4590 ) Signed-off-by: Marcel Juhnke <marrat@marrat.de>	2018-12-19 10:03:33 +00:00
Tariq Ibrahim	de6f3b6af7	expose kubernetes service cluster ip (#4940 ) Signed-off-by: tariqibrahim <tariq.ibrahim@microsoft.com> Signed-off-by: tariqibrahim <tariq181290@gmail.com>	2018-12-18 15:17:34 +00:00
Samuel Alfageme	240321acee	Add taggedAddress to the labels in ConsulSD (#5001 ) Useful when multiple (tagged) addresses for a node are exposed on the catalog API Ref. https://www.consul.io/api/catalog.html#taggedaddresses Signed-off-by: Samuel Alfageme <samuel@alfage.me>	2018-12-18 11:51:05 +01:00
Tariq Ibrahim	e3bdc463fa	Revert "add logic to check if an azure VM is deallocated or not (#4908 )" (#4980 ) This reverts commit `61cf4365` Signed-off-by: tariqibrahim <tariq.ibrahim@microsoft.com>	2018-12-12 09:27:12 +01:00
Ryota Arai	135d580ab2	Introduce min_shards for remote write to set minimum number of shards. (#4924 ) Signed-off-by: Ryota Arai <ryota.arai@gmail.com>	2018-12-04 17:32:14 +00:00
Tariq Ibrahim	61cf4365d6	add logic to check if an azure VM is deallocated or not (#4908 ) * add logic to check if an azure VM is deallocated or not * update documentation with the new azure power state label Signed-off-by: tariqibrahim <tariq.ibrahim@microsoft.com>	2018-11-30 11:32:40 +00:00
Serghei Anicheev	8e659a5109	Adding private_dns_name to the list of ec2 labels which can be used i… (#4693 ) * Adding private_dns_name to the list of ec2 labels which can be used in node naming for dynamic environments Signed-off-by: Serghei Anicheev <serghei@rentalcover.com>	2018-11-30 11:11:06 +00:00
Bryan Boreham	cf37e1feb4	Add __meta_kubernetes_pod_phase label in discovery (#4824 ) This lets you add a relabel rule to drop scrapes for pods which are not running. Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2018-11-06 14:40:24 +00:00
Silvio Gissi	6100f160ad	EC2 Platform meta label (#4663 ) Set __meta_ec2_platform label with the instance platform string. Set to 'windows' on Windows servers and absent otherwise. Signed-off-by: Silvio Gissi <silvio@gissilabs.com>	2018-11-06 14:39:48 +00:00
Timo Beckers	36143be234	docs - refer to documentation/examples/prometheus-marathon.yml Signed-off-by: Timo Beckers <timo@incline.eu>	2018-10-25 18:02:59 +02:00
Kien Nguyen-Tuan	9c5370fdfe	Support discover instances from all projects (#4682 ) By default, OpenStack SD only queries for instances from specified project. To discover instances from other projects, users have to add more openstack_sd_configs for each project. This patch adds `all_tenants` <bool> options to openstack_sd_configs. For example: - job_name: 'openstack_all_instances' openstack_sd_configs: - role: instance region: RegionOne identity_endpoint: http://<identity_server>/identity/v3 username: <username> password: <super_secret_password> domain_name: Default all_tenants: true Co-authored-by: Kien Nguyen <kiennt2609@gmail.com> Signed-off-by: dmatosl <danielmatos.lima@gmail.com>	2018-10-17 13:01:33 +01:00
Brian Brazil	468e49417c	Update remote_write queue docs to present defaults. (#4715 ) Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>	2018-10-10 18:51:27 +01:00
Richard Kiene	b537f6047a	Add ability to filter triton_sd targets by pre-defined groups (#4701 ) Additionally, add triton groups metadata to the discovery reponse and correct a documentation error regarding the triton server id metadata. Signed-off-by: Richard Kiene <richard.kiene@joyent.com>	2018-10-10 10:03:34 +01:00
Simon Pasquier	a2a78d0a09	discovery/openstack: discover all interfaces (#4649 ) * discovery/openstack: discover all interfaces * Add address pool label Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2018-10-09 16:17:08 +01:00
Simon Pasquier	e1e2821cca	Merge pull request #4654 from simonpasquier/openstack-tls discovery/openstack: support tls_config	2018-10-05 18:11:55 +02:00
Ganesh Vernekar	420c0f5e46	Fix docs (#4690 ) Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in>	2018-10-03 12:45:09 +01:00
Ganesh Vernekar	5790d23fd8	Unit testing for rules (#4350 ) * Unit testing for rules * Specifying order of group evaluation in unit tests Signed-off-by: Ganesh Vernekar <cs15btech11018@iith.ac.in>	2018-09-25 17:06:26 +01:00
Simon Pasquier	ff08c40091	discovery/openstack: support tls_config Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2018-09-25 14:31:32 +02:00
Tariq Ibrahim	f708fd5c99	Adding support for multiple azure environments (#4569 ) Signed-off-by: Tariq Ibrahim <tariq.ibrahim@microsoft.com>	2018-09-04 17:55:40 +02:00
Javier Kohen	1c89984778	Expose EC2 instance owner as a discovery label. This exposes the OwnerID field of the DescribeInstances respons as . Signed-off-by: Javier Kohen <jkohen@google.com>	2018-08-17 11:30:18 -04:00
Javier Kohen	2d4bcb3ee1	Document the new __meta_gce_instance_id discovery label. Signed-off-by: Javier Kohen <jkohen@google.com>	2018-08-10 11:59:22 -04:00
Johannes Scheuermann	7608ee87d0	Inital support for Azure VMSS (#4202 ) * Inital support for Azure VMSS Signed-off-by: Johannes Scheuermann <johannes.scheuermann@inovex.de> * Add documentation for the newly introduced label Signed-off-by: Johannes M. Scheuermann <joh.scheuer@gmail.com>	2018-08-01 12:52:21 +01:00
José Martínez	791c13b142	discovery/ec2: Add primary_subnet_id label Signed-off-by: José Martínez <xosemp@gmail.com>	2018-07-25 09:20:58 +01:00
Jannick Fahlbusch ฏ๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎๎	0be25f92e2	EC2 Discovery: Allow to set a custom endpoint (#4333 ) Allowing to set a custom endpoint makes it easy to monitor targets on non AWS providers with EC2 compliant APIs. Signed-off-by: Jannick Fahlbusch <git@jf-projects.de>	2018-07-18 10:48:14 +01:00
Romain Baugue	b41be4ef52	Discovery consul service meta (#4280 ) * Upgrade Consul client * Add ServiceMeta to the labels in ConsulSD Signed-off-by: Romain Baugue <romain.baugue@elwinar.com>	2018-07-18 05:06:56 +01:00
Simon Pasquier	ed99af0b05	docs: fix OpenStack SD for the hypervisor role Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2018-07-15 12:37:57 +01:00
Marcin Owsiany	9fe8bcf4be	Fix markup in example. (#4351 ) Signed-off-by: Marcin Owsiany <marcin@owsiany.pl>	2018-07-05 09:13:00 +01:00
Damien Lespiau	e64037053d	Expose controller kind and name to labelling rules Relabelling rules can use this information to attach the name of the controller that has created a pod. In turn, this can be used to slice metrics by workload at query time, ie. "Give me all metrics that have been created by the $name Deployment" Signed-off-by: Damien Lespiau <damien@weave.works>	2018-05-09 11:51:37 +02:00
Nathan Graves	5b27996cb3	Include GCE labels during service discovery. Updated vendor files for Google API. (#4150 ) Signed-off-by: Nathan Graves <nathan.graves@kofile.us>	2018-05-08 17:37:47 +01:00
Daisy T	b424eb42e3	document remote write queue parameters (#4126 )	2018-04-30 20:08:45 +02:00
Brian Brazil	fbe66819c5	Update ALERTS docs for 2.0 staleness changes. (#4116 ) Signed-off-by: Brian Brazil <brian.brazil@robustperception.io>	2018-04-26 12:44:11 +01:00
Adam Shannon	809881d7f5	support reading basic_auth password_file for HTTP basic auth (#4077 ) Issue: https://github.com/prometheus/prometheus/issues/4076 Signed-off-by: Adam Shannon <adamkshannon@gmail.com>	2018-04-25 18:19:06 +01:00
Philippe Laflamme	2aba238f31	Use common HTTPClientConfig for marathon_sd configuration (#4009 ) This adds support for basic authentication which closes #3090 The support for specifying the client timeout was removed as discussed in https://github.com/prometheus/common/pull/123. Marathon was the only sd mechanism doing this and configuring the timeout is done through `Context`. DC/OS uses a custom `Authorization` header for authenticating. This adds 2 new configuration properties to reflect this. Existing configuration files that use the bearer token will no longer work. More work is required to make this backwards compatible.	2018-04-05 09:08:18 +01:00
albatross0	0245fd55bf	Add a machine type label to GCE SD (#4032 )	2018-03-31 09:20:19 +01:00
Kristiyan Nikolov	be85ba3842	discovery/ec2: Support filtering instances in discovery (#4011 )	2018-03-31 07:51:11 +01:00
Corentin Chary	60dafd425c	consul: improve consul service discovery (#3814 ) * consul: improve consul service discovery Related to #3711 - Add the ability to filter by tag and node-meta in an efficient way (`/catalog/services` allow filtering by node-meta, and returns a `map[string]string` or `service`->`tags`). Tags and nore-meta are also used in `/catalog/service` requests. - Do not require a call to the catalog if services are specified by name. This is important because on large cluster `/catalog/services` changes all the time. - Add `allow_stale` configuration option to do stale reads. Non-stale reads can be costly, even more when you are doing them to a remote datacenter with 10k+ targets over WAN (which is common for federation). - Add `refresh_interval` to minimize the strain on the catalog and on the service endpoint. This is needed because of that kind of behavior from consul: https://github.com/hashicorp/consul/issues/3712 and because a catalog on a large cluster would basically change all the time. No need to discover targets in 1sec if we scrape them every minute. - Added plenty of unit tests. Benchmarks ---------- ```yaml scrape_configs: - job_name: prometheus scrape_interval: 60s static_configs: - targets: ["127.0.0.1:9090"] - job_name: "observability-by-tag" scrape_interval: "60s" metrics_path: "/metrics" consul_sd_configs: - server: consul.service.par.consul.prod.crto.in:8500 tag: marathon-user-observability # Used in After refresh_interval: 30s # Used in After+delay relabel_configs: - source_labels: [__meta_consul_tags] regex: ^(.,)?marathon-user-observability(,.)?$ action: keep - job_name: "observability-by-name" scrape_interval: "60s" metrics_path: "/metrics" consul_sd_configs: - server: consul.service.par.consul.prod.crto.in:8500 services: - observability-cerebro - observability-portal-web - job_name: "fake-fake-fake" scrape_interval: "15s" metrics_path: "/metrics" consul_sd_configs: - server: consul.service.par.consul.prod.crto.in:8500 services: - fake-fake-fake ``` Note: tested with ~1200 services, ~5000 nodes. \| Resource \| Empty \| Before \| After \| After + delay \| \| -------- \|:-----:\|:------:\|:-----:\|:-------------:\| \|/service-discovery size\|5K\|85MiB\|27k\|27k\|27k\| \|`go_memstats_heap_objects`\|100k\|1M\|120k\|110k\| \|`go_memstats_heap_alloc_bytes`\|24MB\|150MB\|28MB\|27MB\| \|`rate(go_memstats_alloc_bytes_total[5m])`\|0.2MB/s\|28MB/s\|2MB/s\|0.3MB/s\| \|`rate(process_cpu_seconds_total[5m])`\|0.1%\|15%\|2%\|0.01%\| \|`process_open_fds`\|16\|1236\|22\|22\| \|`rate(prometheus_sd_consul_rpc_duration_seconds_count{call="services"}[5m])`\|~0\|1\|1\|0.03\| \|`rate(prometheus_sd_consul_rpc_duration_seconds_count{call="service"}[5m])`\|0.1\|80\|0.5\|0.5\| \|`prometheus_target_sync_length_seconds{quantile="0.9",scrape_job="observability-by-tag"}`\|N/A\|200ms\|0.2ms\|0.2ms\| \|Network bandwidth\|~10kbps\|~2.8Mbps\|~1.6Mbps\|~10kbps\| Filtering by tag using relabel_configs uses 100kiB and 23kiB/s per service per job and quite a lot of CPU. Also sends and additional 1Mbps of traffic to consul. Being a little bit smarter about this reduces the overhead quite a lot. Limiting the number of `/catalog/services` queries per second almost removes the overhead of service discovery. * consul: tweak `refresh_interval` behavior `refresh_interval` now does what is advertised in the documentation, there won't be more that one update per `refresh_interval`. It now defaults to 30s (which was also the current waitTime in the consul query). This also make sure we don't wait another 30s if we already waited 29s in the blocking call by substracting the number of elapsed seconds. Hopefully this will do what people expect it does and will be safer for existing consul infrastructures.	2018-03-23 14:48:43 +00:00
Yecheng Fu	56ed29fbf7	Map target infos of endpoints to prometheus meta labels. (#3770 )	2018-03-09 10:07:00 +00:00
Jeffrey Zhang	21f96caab3	Fix wrong syntax for alert field templates (#3883 )	2018-02-24 09:37:43 +00:00
Pedro Araújo	575f665944	Add OS type meta label to Azure SD (#3863 ) There is currently no way to differentiate Windows instances from Linux ones. This is needed when you have a mix of node_exporters / wmi_exporters for OS-level metrics and you want to have them in separate scrape jobs. This change allows you to do just that. Example: ``` - job_name: 'node' azure_sd_configs: - <azure_sd_config> relabel_configs: - source_labels: [__meta_azure_machine_os_type] regex: Linux action: keep ``` The way the vendor'd AzureSDK provides to get the OsType is a bit awkward - as far as I can tell, this information can only be gotten from the startup disk. Newer versions of the SDK appear to improve this a bit (by having OS information in the InstanceView), but the current way still works.	2018-02-19 15:40:57 +00:00
Andrea Giardini	3a9637fa3c	docs: Fix remote_read/remote_timeout default (#3829 )	2018-02-12 12:52:33 +00:00
zemek	8a01a0fbed	Set consul server default to localhost:8500 (#3703 )	2018-01-24 12:14:32 +00:00
James Turnbull	00f4821178	Added missing ingress from role list (#3666 )	2018-01-08 21:23:01 +00:00
Brian Brazil	fba80da635	Fix default of read_recent to be false. (#3617 ) This is what is documented in the migration guide, and the default settings should make sense for a true long term storage. Document the setting.	2017-12-23 17:21:38 +00:00

1 2 3 4

170 commits