node_exporter

mirror of https://github.com/prometheus/node_exporter.git synced 2025-08-20 18:33:52 -07:00

Author	SHA1	Message	Date
Ben Kochie	1824ac3b9e	Fix smartmon.sh textfile script (#700 ) When there are no SMART compatible devices (Raspberry Pi for example) an error is returned, but the return code is still 0. `# scan_smart_devices: glob(3) aborted matching pattern /dev/discs/disc` Remove unused `disks` variable. * Filter for only valid `/dev` devices.	2017-10-18 07:37:47 +02:00
Siavash Safi	f3a7022602	Add `collect[]` parameter (#699 ) * Add `collect[]` parameter * Add TODo comment about staticcheck ignored * Restore promhttp.HandlerOpts * Log a warning and return HTTP error instead of failing * Check collector existence and status, cleanups * Fix warnings and error messages * Don't panic, return error if collector registration failed * Update README	2017-10-14 14:23:42 +02:00
Ben Kochie	8f9edf87b5	Add extra notes to Building section (#694 ) * Add link to Golang * Add note about RHEL/CentOS build dep.	2017-10-11 11:46:13 +02:00
Wei Wei	1e4af21256	add rslave for docker example, so node_exporter can receive host mount/unmount events (#660 )	2017-10-11 11:18:30 +02:00
Ben Kochie	6e2053c557	Fix circle docker test tag name. (#688 ) The default DOCKER_IMAGE_TAG setup fails when running in circle, override with the CIRCLE_TAG.	2017-10-06 12:33:03 +02:00
Ben Kochie	f84dd15be7	Release v0.15.0 (#686 ) * Release v0.15.0 * Bump version. * Update CHANGELOG. * Update to Go 1.9 in circle.yml	2017-10-06 09:43:58 +02:00
Ben Kochie	deadfef4c9	Update vendoring (#685 ) * Update vendor github.com/coreos/go-systemd/dbus@v15 * Update vendor github.com/ema/qdisc * Update vendor github.com/godbus/dbus * Update vendor github.com/golang/protobuf/proto * Update vendor github.com/lufia/iostat * Update vendor github.com/matttproud/golang_protobuf_extensions/pbutil@v1.0.0 * Update vendor github.com/prometheus/client_golang/... * Update vendor github.com/prometheus/common/... * Update vendor github.com/prometheus/procfs/... * Update vendor github.com/sirupsen/logrus@v1.0.3 Adds vendor golang.org/x/crypto * Update vendor golang.org/x/net/... * Update vendor golang.org/x/sys/... * Update end to end output.	2017-10-05 16:20:47 +02:00
Tobias Schmidt	ba96b6561b	Merge pull request #682 from derekmarcotte/dm-386-native Only enable race detector when GOHOSTARCH is amd64.	2017-10-05 09:07:52 +02:00
Ben Kochie	a47f033f1b	Add text file helper for apt-get. (#680 ) * Add metric for pending upgrades. * Add metric for pending reboot required.	2017-10-04 08:34:30 +02:00
Brett Vickers	b62c7bc0ad	Updated vendored ntp package (#681 ) The github.com/beevik/ntp package was recently updated with some API changes that broke node_exporter. This commit fetches the latest version of the ntp package and brings node_exporter in line with the latest API.	2017-10-04 08:33:49 +02:00
Derek Marcotte	a6b8922a01	Only enable race detector when GOHOSTARCH is amd64. This enables native builds to still run the test and all targets without problems on say 386. Build failure on Buildkite build 85, prevents enabling native FreeBSD 386 builds.	2017-10-03 16:40:22 -04:00
Calle Pettersson	859a825bb8	Replace --collectors.enabled with per-collector flags (#640 ) * Move NodeCollector into package collector * Refactor collector enabling * Update README with new collector enabled flags * Fix out-of-date inline flag reference syntax * Use new flags in end-to-end tests * Add flag to disable all default collectors * Track if a flag has been set explicitly * Add --collectors.disable-defaults to README * Revert disable-defaults flag * Shorten flags * Fixup timex collector registration * Fix end-to-end tests * Change procfs and sysfs path flags * Fix review comments	2017-09-28 15:06:26 +02:00
Sami Kerola	3762191e66	Add timex collector (#664 ) This collector is based on adjtimex(2) system call. The collector returns three values, status if time is synchronised, offset to remote reference, and local clock frequency adjustment. Values are taken from kernel time keeping data structures to avoid getting involved how the synchronisation is implemented. By that I mean one should not care if time is update using ntpd, systemd.timesyncd, ptpd, and so on. Since all time sync implementation will always end up telling to kernel what is the status with time one can simply omit the software in between, and look results of the syncing. As a positive side effect this makes collector very quick and conceptually specific, this does not monitor availability of NTP server, or network in between, or dns resolution, and other unrelated but necessary things. Minimum set of values to keep eye on are the following three: The node_timex_sync_status tells if local clock is in sync with a remote clock. Value is set to zero when synchronisation to a reliable server is lost, or a time sync software is misconfigured. The node_timex_offset_seconds tells how much local clock is off when compared to reference. In case of multiple time references this value is outcome of RFC 5905 adjustment algorithm. Ideally offset should be close to zero, and it depends about use case how large value is acceptable. For example a typical web server is probably fine if offset is about 0.1 or less, but that would not be good enough for mobile phone base station operator. The node_timex_freq tells amount of adjustment to local clock tick frequency. For example if offset is one second and growing the local clock will need instruction to tick quicker. Number value itself is not very important, and occasional small adjustments are fine. When frequency is unusually in stable one can assume quality of time stamps will not be accurate to very far in sub second range. Obviously explaining why local clock frequency behaves like a passenger in roller coaster is different matter. Explanations can vary from system load, to environmental issues such as a machine being physically too hot. Rest of the measurements can help when debugging. If you run a clock server do probably want to collect and keep track of everything. Pull-request: https://github.com/prometheus/node_exporter/pull/664	2017-09-19 07:54:06 -07:00
Leonid Evdokimov	c169b4b1c5	Add metrics from SNTPv4 packet to ntp collector & add ntpd sanity check (#655 ) * Add metrics from SNTPv4 packet to ntp collector & add ntpd sanity check 1. Checking local clock against remote NTP daemon is bad idea, local ntpd acting as a client should do it better and avoid excessive load on remote NTP server so the collector is refactored to query local NTP server. 2. Checking local clock against remote one does not check local ntpd itself. Local ntpd may be down or out of sync due to network issues, but clock will be OK. 3. Checking NTP server using sanity of it's response is tricky and depends on ntpd implementation, that's why common `node_ntp_sanity` variable is exported. * `govendor add golang.org/x/net/ipv4`, it is dependency of github.com/beevik/ntp * Update github.com/beevik/ntp to include boring SNTP fix * Use variable name from RFC5905 * ntp: move code to make export of raw metrics more explicit * Move NTP math to `github.com/beevik/ntp` * Make `golint` happy * Add some brief docs explaining `ntp` #655 and `timex` #664 modules * ntp: drop XXX comment that got its decision * ntp: add `_seconds` suffix to relevant metrics * Better `node_ntp_leap` comment * s/node_ntp_reftime/node_ntp_reference_timestamp_seconds/ as requested by @discordianfish * Extract subsystem name to const as suggested by @SuperQ	2017-09-19 10:36:14 +02:00
Karsten Weiss	b0d5c00832	cpu: Metric 'package_throttles_total' is per package. (#657 ) * cpu: Metric 'package_throttles_total' is per package. 'package_throttles_total' is per package, not per cpu. This also reduces the total number of cpu time series a lot (esp for multi core cpus). * cpu: Better handling of a cpulist edge-case. * cpu: Extract the package number from the directory name. Do not rely on the range index. * cpu: Add package_throttle_count for node0 cpu1 This file must be ignored by the cpu collector.	2017-09-07 23:24:18 +02:00
Alexey Palazhchenko	abb58a31e2	Test with Go 1.9.x (#667 )	2017-08-31 18:00:55 +02:00
Matt Bostock	89a2f21f45	Always try to return smartmon_device_info metric (#663 ) * Always try to return smartmon_device_info metric Sometimes the 'model family' field is not returned by `smartctl' because a disk is not in the disk database for the version of smartmontools installed on the system. In those cases, the device model and serial number is still returned (at least as far as I have observed. Re-work the logic to prefer the 'vendor' field first, and if not present, always output a `smartmon_device_info` metric even if some labels have empty values. On the box I'm testing this on, where previously no metric was returned, it now returns: # HELP smartmon_device_info SMART metric device_info # TYPE smartmon_device_info gauge smartmon_device_info{disk="/dev/sda",type="sat",model_family="",device_model="INTEL REDACTED",serial_number="REDACTED",firmware_version="REDACTED"} 1 smartmon_device_info{disk="/dev/sdb",type="sat",model_family="",device_model="INTEL REDACTED",serial_number="REDACTED",firmware_version="REDACTED"} 1 smartmon_device_info{disk="/dev/sdc",type="sat",model_family="",device_model="INTEL REDACTED",serial_number="REDACTED",firmware_version="REDACTED"} 1 smartmon_device_info{disk="/dev/sdd",type="sat",model_family="",device_model="INTEL REDACTED",serial_number="REDACTED",firmware_version="REDACTED"} 1 smartmon_device_info{disk="/dev/sde",type="sat",model_family="",device_model="INTEL REDACTED",serial_number="REDACTED",firmware_version="REDACTED"} 1 smartmon_device_info{disk="/dev/sdf",type="sat",model_family="",device_model="INTEL REDACTED",serial_number="REDACTED",firmware_version="REDACTED"} 1 * Add trailing newline Because POSIX: https://stackoverflow.com/a/729795	2017-08-31 18:00:42 +02:00
Tobias Schmidt	f9a2388c60	Merge pull request #662 from prometheus/bjk/buildkite Add buildkite status badge.	2017-08-24 12:59:18 +02:00
Ben Kochie	9947f602f3	Add buildkite status badge.	2017-08-24 12:29:34 +02:00
Matthias Rampke	d3e3a9c181	Only cross-test 32bit on Linux (#658 ) This doesn't work on at least FreeBSD and Darwin. It does work on Linux, only try it there.	2017-08-24 09:13:17 +02:00
Christian Will	2ed98fd5a5	define binary name in promu configuration file (#650 )	2017-08-22 17:24:07 +02:00
Tobias Schmidt	505275b48c	Merge pull request #652 from prometheus/mr/test-32 Automatically cross-test 32bit based on GOARCH	2017-08-22 00:10:04 +02:00
Tobias Schmidt	ba6897583b	Merge pull request #653 from prometheus/mr/fix-629 Use int64 throughout the ZFS collector.	2017-08-21 22:28:37 +02:00
Matthias Rampke	7420046383	Automatically cross-test 32bit based on GOARCH Try to determine the corresponding 32bit architecture from the current GOARCH and run the tests under that architecture. This only works on a GOOS/GOARCH that can execute binaries for the smaller architecture, such as running linux/386 binaries under linux/amd64. I tested that this works under linux/amd64 and darwin/amd64, the rest of the architectures is guesswork. While we still only run regular tests on Intel/Linux architectures, this covers general integer overflow issues like #629.	2017-08-21 17:27:25 +00:00
Matthias Rampke	5aa6819eb1	gofmt node_exporter_test	2017-08-21 16:45:42 +00:00
Matthias Rampke	e1f129c729	Use int64 throughout the ZFS collector. This avoids issues with integer overflows on 32-bit architectures. The Prometheus data format is float64, so regardless of the architecture we should handle large numbers. Fixes #629.	2017-08-21 16:40:16 +00:00
Matthias Rampke	8661bbbb42	Merge pull request #651 from TheTincho/fix_integration_test_timing Fix path and timing issues with integration tests.	2017-08-19 15:12:42 +02:00
Martín Ferrari	2cd49eb020	Fix path and timing issues with integration tests.	2017-08-19 11:37:57 +02:00
Ben Kochie	8839640cd1	Ignore wifi collector permission errors (#646 ) Ignore the permission denined error when the wifi collector has no permission to read metrics.	2017-08-18 10:19:48 +02:00
Ben Kochie	b7cc6fbea7	Add additional field to github issue template. (#645 ) * Add additional field to github issue template. Request the command line flags to the exporter. * Update version flag for kingpin.	2017-08-17 12:44:26 +02:00
Hemant Kumar	de08e38c5e	Add dockerfile for ppc64le (#638 ) * Add dockerfile for ppc64le and related changes * Pass the fill file as DOCKEFILE * Add the dockerfile name to build msg	2017-08-17 11:53:04 +02:00
Joe Handzik	4b011bfe44	Clarify Infiniband collector support (#643 ) Tested a DL360 Gen9 box with an Omni-Path adapter in it. The existing InfiniBand collector can provide support for the same metrics on Omni-Path cards as well. Signed-Off-By: Joe Handzik <joseph.t.handzik@hpe.com>	2017-08-16 07:32:54 +02:00
Calle Pettersson	dfe07eaae8	Switch to kingpin flags (#639 ) * Switch to kingpin flags * Fix logrus vendoring * Fix flags in main tests * Fix vendoring versions	2017-08-12 15:07:24 +02:00
Vojtech Galda	1467d845fb	Status information in /proc/drbd (#630 ) in version 8.4 deprecated (but won’t be removed)	2017-08-02 08:04:13 +02:00
Matthias Rampke	6506513be5	Merge pull request #626 from teohhanhui/patch-1 Fix Docker mountpoint prefix docs	2017-07-28 09:32:19 +02:00
Teoh Han Hui	0b1f64bb15	Fix Docker mountpoint prefix docs	2017-07-28 15:06:28 +08:00
Ben Kochie	46c31d8a7e	Enable IPVS collector by default (#623 ) * Silence error output when no IPVS present. * Enable by default. * Update end-to-end fixture. * Update README.	2017-07-26 15:20:28 +02:00
Tobias Schmidt	efe5f62717	Merge pull request #620 from prometheus/grobie/fix-meminfo-collector Restrict build tags of collectors to supported operating systems	2017-07-20 15:25:47 -04:00
Tobias Schmidt	515b5a933d	Fix build tags of loadavg collector The collector is only implemented for a subset of all operating systems supported by go. Compilation will fail if attempted for another OS target.	2017-07-20 15:13:58 -04:00
Tobias Schmidt	016d79535d	Fix build tags of meminfo collector The meminfo collector only supports darwin, dragonfly, freebsd and linux and must not be included in other archtictures.	2017-07-20 14:37:10 -04:00
Tobias Schmidt	efc1ea14ba	Ignore extracted sysfs fixture files from git	2017-07-20 14:36:48 -04:00
Andrea De Pasquale	1369763067	Change raid0 status line regexp for mdadm collector (#619 )	2017-07-20 17:04:33 +02:00
Ben Kochie	971de21945	Minor tweak to GitHub issue template.	2017-07-20 10:57:07 +02:00
Tobias Schmidt	921319c7eb	Merge pull request #583 from knweiss/golint Golint fixes	2017-07-10 23:49:36 +02:00
Aleksey Zhukov	7a914e58f2	Add parsing /proc/net/snmp6 file for netstat-linux (#615 ) * Add parsing /proc/net/snmp6 file * add /proc/net/snmp6 fixture * fix e2e test * gofmt * remove unuser variable * safe checks * add tests * change help format	2017-07-08 20:16:35 +02:00
Jerome Froelich	cb14fff6c6	[test] Call cmd.Start and cmd.Wait separately to avoid triggering race detector (#616 ) * [test] Call cmd.Start and cmd.Wait separately to avoid triggering race detector * [test] Enable race detector for tests	2017-07-08 20:15:40 +02:00
Matt Layher	6e82fd1c56	Add XFS block mapping and block map B-tree stats (#575 )	2017-07-07 07:27:52 +02:00
fahlke	a89d72b5eb	Resolves prometheus/node_exporter#585 (#586 ) * Resolves prometheus/node_exporter#585 * - removed 'docker rm' as it is not allowed on CircleCI See discussion: https://discuss.circleci.com/t/docker-error-removing-intermediate-container/70	2017-07-07 07:26:11 +02:00
ideaship	8d90276283	Add bcache collector (#597 ) * Add bcache collector for Linux This collector gathers metrics related to the Linux block cache (bcache) from sysfs. * Removed commented out code * Use project comment style * Add _sectors to metric name to indicate unit * Really use project comment style * Rename bcache.go to bcache_linux.go * Keep collector namespace clean Rename: - metric -> bcacheMetric - periodStatsToMetrics -> bcachePeriodStatsToMetric * Shorten slice initialization * Change label names to backing_device, cache_device * Remove five minute metrics (keep only total) * Include units in additional metric names * Enable bcache collector by default * Provide metrics in seconds, not nanoseconds * remove metrics with label "all" * Add fixtures, update end-to-end for bcache collector * Move fixtures/sys into tar.gz This changeset moves the collector/fixtures/sys directory into collector/fixtures/sys.tar.gz and tweaks the Makefile to unpack the tarball before tests are run. The reason for this change is that Windows does not allow colons in a path (colons are present in some of the bcache fixture files), nor can it (out of the box) deal with pathnames longer than 260 characters (which we would be increasingly likely to hit if we tried to replace colons with longer codes that are guaranteed not the turn up in regular file names). * Add ttar: plain text archive, replacement for tar This changeset adds ttar, a plain text replacement for tar, and uses it for the sysfs fixture archive. The syntax is loosely based on tar(1). Using a plain text archive makes it possible to review changes without downloading and extracting the archive. Also, when working on the repo, git diff and git log become useful again, allowing a committer to verify and track changes over time. The code is written in bash, because bash is available out of the box on all major flavors of Linux and on macOS. The feature set used is restricted to bash version 3.2 because that is what Apple is still shipping. The programm also works on Windows if bash is installed. Obviously, it does not solve the Windows limitations (path length limited to 260 characters, no symbolic links) that prompted the move to an archive format in the first place.	2017-07-07 07:20:18 +02:00
Alexey Palazhchenko	bba075710d	Set Go import path on Travis CI (#612 )	2017-07-06 14:12:22 +02:00

1 2 3 4 5 ...

881 commits