node_exporter

mirror of https://github.com/prometheus/node_exporter.git synced 2025-08-20 18:33:52 -07:00

Author	SHA1	Message	Date
Ryan J. Geyer	5e552bac02	Replace mistaken ) with }, resulting in parsable promql Signed-off-by: Ryan J. Geyer <me@ryangeyer.com>	2022-12-13 13:30:42 +01:00
Jan Fajerski	87b8e3790d	docs/node-mixin: add fsMointpointSelector to alerts and dashboards (#2446 ) * docs/node-mixin: add fsMountpointSelector This adds the option to add a `mountpoint` selector to filesystem related alerts. The default is `mountpoint!=""`. * docs/node-mixins: add fsMountpointSelector to dashboards Signed-off-by: Jan Fajerski <jfajersk@redhat.com>	2022-10-20 13:06:31 +02:00
Gabriel Amaral Antunes	410e069471	Add darwin dashboard to mixin Signed-off-by: Vitaly Zhuravlev <zhuravlev.vitaly@gmail.com>	2022-04-20 15:18:43 +04:00
Severyn Lisovskyi	7b86b7cb29	[node-mixin] change current datasource to grafana's default Signed-off-by: Severyn Lisovskyi <993215+sev3ryn@users.noreply.github.com>	2022-02-02 14:45:26 +01:00
Julian Wiedmann	3e6f4ce627	mixin: exclude iowait and steal from CPU Utilisation (#2194 ) 'iowait' and 'steal' indicate specific idle/wait states, which shouldn't be counted into CPU Utilisation. Also see https://github.com/prometheus-operator/kube-prometheus/pull/796 and https://github.com/kubernetes-monitoring/kubernetes-mixin/pull/667. Per the iostat man page: %idle Show the percentage of time that the CPU or CPUs were idle and the system did not have an outstanding disk I/O request. %iowait Show the percentage of time that the CPU or CPUs were idle during which the system had an outstanding disk I/O request. %steal Show the percentage of time spent in involuntary wait by the virtual CPU or CPUs while the hypervisor was servicing another virtual processor. Signed-off-by: Julian Wiedmann <jwi@linux.ibm.com>	2021-11-04 11:03:27 +01:00
Tom Wilkie	9bc184d236	Datasource template variable should be labelled 'Data Source' Signed-off-by: Tom Wilkie <tom@grafana.com>	2021-10-20 17:10:14 +01:00
Johannes 'fish' Ziemke	7fc5c6045a	Read config from $ Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	2021-07-27 16:32:05 +02:00
ArthurSens	3731f93fd7	Refactor USE method mixin dashboards with grafonnet-lib, add multi-cluster support. Aiming for cleaner code and following standards used on younger mixins. Signed-off-by: ArthurSens <arthursens2005@gmail.com>	2021-07-27 16:32:05 +02:00
raviprasad_lr	504f9b785c	fix interval in graphs panels of node dashboard Signed-off-by: raviprasad_lr <raviprasad_lr@yahoo.com>	2021-04-26 11:14:30 +02:00
Johannes 'fish' Ziemke	a5908bf82b	Make interval configurable Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	2021-04-07 09:37:04 +02:00
Johannes 'fish' Ziemke	772335caa8	Use 5m rate in mixins The default scrape interval of Prometheus is 60s, so we can't use a 1m rate. Signed-off-by: Johannes 'fish' Ziemke <github@freigeist.org>	2021-04-07 09:37:04 +02:00
Matthias Loibl	77e76485c0	Use absolute jsonnet import paths This should be the way forward when importing libraries in jsonnet. It's closer to how Go imports look and makes it more obvious where packages live. This is not breaking anything, as the old imports were already symlinks to the now directly used directories. Signed-off-by: Matthias Loibl <mail@matthiasloibl.com>	2020-10-20 11:34:43 +02:00
Arthur Outhenin-Chalandre	6585e43eec	Fix memory gauge in mixin with multiple pods Signed-off-by: Arthur Outhenin-Chalandre <arthur@cri.epita.fr>	2020-09-23 15:36:43 +02:00
paulfantom	40570924b1	docs/node-mixin/dashboards: do not mix tabs and spaces Signed-off-by: paulfantom <pawel@krupa.net.pl>	2019-11-01 15:46:21 +01:00
beorn7	c6914477f5	Fix the normalization for the cluster-wide dashboards We actually have to count or sum, respectively, _all_ the selected metrics for the cluster-wide view. Which means it's easiest to use the `scalar` approach after all (but only in the cluster dashboard). This still propagates all the labels. I have extended the comment for the `nodeExporterSelector` to note that the cluster dashboard only makes sense if all the selected node exporter actually belong to the same cluster. Since this is jsonnet, users can easily disable the cluster dashboard. Or even create multiple instances of the dashboards with different `nodeExporterSelector`s for different clusters. Signed-off-by: beorn7 <beorn@grafana.com>	2019-10-30 22:52:36 +01:00
Benoît Knecht	5a7b85876d	docs/node-mixin: Improve memory pressure rule The `instance:node_memory_swap_io_pages:rate1m` rule was intended to measure the amount of memory pressure a system is under, but its name is a bit misleading (it specifically refers to swap), and the rate of `node_vmstat_pgmajfault` is a better metric for memory pressure (see #1524). This commit renames `instance:node_memory_swap_io_pages:rate1m` to `instance:node_vmstat_pgmajfault:rate1m`, and defines it as `rate(node_vmstat_pgmajfault{%(nodeExporterSelector)s}[1m])`. The dashboards are updated accordingly. Signed-off-by: Benoît Knecht <benoit.knecht@fsfe.org>	2019-10-28 15:12:42 +01:00
Sergiusz Urbaniak	ed78237036	node-mixin: fix query in Disk Space Utilisation dashboard Signed-off-by: Sergiusz Urbaniak <sergiusz.urbaniak@gmail.com>	2019-09-12 14:02:56 +02:00
beorn7	76ff263ca6	Update legendLink This still had the 'k8s' in as it was copied and pasted from the kubernetes-mixin. Signed-off-by: beorn7 <beorn@grafana.com>	2019-08-20 18:49:12 +02:00
beorn7	44e5731de7	Add line for number of cores to load graph Backported from the node dashboard in the kubernetes-mixin. Signed-off-by: beorn7 <beorn@grafana.com>	2019-08-15 16:43:57 +02:00
beorn7	024d5ed55e	Fix title of CPU panel to usage We use the `mode="idle"` metric, but we are inverting it, so this is usage, and that's intended. Signed-off-by: beorn7 <beorn@grafana.com>	2019-08-15 16:36:10 +02:00
beorn7	a016d9cd6f	node-mixin: Improve disk usage panel - Use a stacked graph instead of a gauge as development over time is especially useful for disk space usage. - By only taking one metric per device into account, we avoid double-counting for devices that are mounted multiple times. Signed-off-by: beorn7 <beorn@grafana.com>	2019-08-15 16:32:54 +02:00
Björn Rabenstein	7ef6f2576d	node-mxin: Improve nodes dashboard (#1448 ) * node-mixin: Improve nodes dashboard - Use stacking where it makes sense. - Normalize idle CPU so that stacking is more meaningful. - Consistently fill where stacking is used but don't fill where not. - Fix y axis max value for Idle CPU panel. - Fix y axis min value for memory usage panel. - Use `$__interval` for range where applicable (and set min step to 1m). - Make the right Y axis for disk I/O actually work. This is just an incremental improvements. It doesn't touch the more involved TODOs. Signed-off-by: beorn7 <beorn@grafana.com>	2019-08-15 00:40:51 +02:00
beorn7	f350aaf87e	node-mixin: Fix various straight-forward issues in the USE dashboards - Normalize cluster memory utilisation. - Fix missing `1m` in memory saturation. - Have both disk-related row next to each other instead with the network row in between. - Correctly render transmit network traffic as negative, using `seriesOverrides` and `min: null` for the y-axis. - Make panel and row naming consistent. - Remove legend where it would just display a single entry with exactly the title of the panel. - Fix metric name in individual node CPU Saturation panel. - Break up disk space utilisation by device in the panel for an individual node. NB: All of that doesn't touch any more subtle issues captured in the various TODOs. Signed-off-by: beorn7 <beorn@grafana.com>	2019-08-13 21:54:28 +02:00
paulfantom	c41826274d	docs/node-mixin: move fsSelector and diskDeviceSelector to the end of query This will cause a query to be valid even if values of selector are empty. Additionally fixing query responsible for disk space usage. Signed-off-by: paulfantom <pawel@krupa.net.pl>	2019-07-24 13:05:02 +02:00
beorn7	79f0357e38	Added `_excluding_lo` to name of network rules that exclude lo Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-22 20:21:52 +02:00
beorn7	36dc7451c9	Improvement of comments and panel titles Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-22 14:06:27 +02:00
beorn7	e01d9f9e78	Break out device in disk IO rules/dashboard Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-18 15:59:35 +02:00
beorn7	b8c4b0cb29	Removed unneeded `sum_` and `avg_` from rule names Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-18 14:14:02 +02:00
beorn7	706511a495	Responses to review comments, round 3 Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-17 23:54:31 +02:00
beorn7	a92d1d7889	Address review comments, batch 2 Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-16 21:18:17 +02:00
beorn7	3ab1f41d12	Make more use of config.libsonnet Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-16 19:34:27 +02:00
beorn7	2180c2f3bf	Address first batch of old review comments Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-16 19:14:17 +02:00
beorn7	dec5b5b053	Fix indentation Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-10 20:07:20 +02:00
beorn7	9d7045e483	(Re-)adjust to Grafana gauge expecting percentage 0-100 (rather than 1-0) Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-10 19:40:04 +02:00
beorn7	f331b308f3	Use promgrafonnet as a vendored library from its source The only deviation that happened so far is to use format="percentunit" in a Grafana gauge. This change wasn't even properly used in this repo so far, so I opted to stick with "upstream" for now. If changes are really needed, we can try to change upstream first. Another change was done in parallal here and upstream, but it was "more correct" in upstream. (Change datasource to $datasource variable, only partially applied here.) Which is another point for using the upstream and not copy it here. Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-06 21:11:23 +02:00
beorn7	2df034c055	Move node-mixin into docs directory Signed-off-by: beorn7 <beorn@grafana.com>	2019-07-05 19:38:03 +02:00

36 commits