prometheus

mirror of https://github.com/prometheus/prometheus.git synced 2024-11-15 01:54:06 -08:00

Author	SHA1	Message	Date
Julius Volz	01f652cb4c	Separate storage implementation from interfaces. This was initially motivated by wanting to distribute the rule checker tool under `tools/rule_checker`. However, this was not possible without also distributing the LevelDB dynamic libraries because the tool transitively depended on Levigo: rule checker -> query layer -> tiered storage layer -> leveldb This change separates external storage interfaces from the implementation (tiered storage, leveldb storage, memory storage) by putting them into separate packages: - storage/metric: public, implementation-agnostic interfaces - storage/metric/tiered: tiered storage implementation, including memory and LevelDB storage. I initially also considered splitting up the implementation into separate packages for tiered storage, memory storage, and LevelDB storage, but these are currently so intertwined that it would be another major project in itself. The query layers and most other parts of Prometheus now have notion of the storage implementation anymore and just use whatever implementation they get passed in via interfaces. The rule_checker is now a static binary :) Change-Id: I793bbf631a8648ca31790e7e772ecf9c2b92f7a0	2014-04-16 13:30:19 +02:00
Julius Volz	c7c0b33d0b	Add regex-matching support for labels. There are four label-matching ops for selecting timeseries now: - Equal: = - NotEqual: != - RegexMatch: =~ - RegexNoMatch: !~ Instead of looking up labels by a simple clientmodel.LabelSet (basically an equals op for every key/value pair in the set), timeseries fingerprint selection is now done via a list of metric.LabelMatchers. Change-Id: I510a83f761198e80946146770ebb64e4abc3bb96	2014-04-01 14:24:53 +02:00
Julius Volz	ae30453214	Add label names -> label values index. Change-Id: Ie39b4044558afc4d1aa937de7dcf8df61f821fb4	2014-03-28 15:16:37 +01:00
Julius Volz	c6013ff309	Remove unused labelname -> fingerprints index. Change-Id: Ie4ccea3a230532e670030ca64ede9435b1b3e506	2014-03-05 23:49:33 +01:00
Bjoern Rabenstein	6bc083f38b	Major code cleanup in storage. - Mostly docstring fixed/additions. (Please review these carefully, since most of them were missing, I had to guess them from an outsider's perspective. (Which on the other hand proves how desperately required many of these docstrings are.)) - Removed all uses of new(...) to meet our own style guide (draft). - Fixed all other 'go vet' and 'golint' issues (except those that are not fixable (i.e. caused by bugs in or by design of 'go vet' and 'golint')). - Some trivial refactorings, like reorder functions, minor renames, ... - Some slightly less trivial refactoring, mostly to reduce code duplication by embedding types instead of writing many explicit forwarders. - Cleaned up the interface structure a bit. (Most significant probably the removal of the View-like methods from MetricPersistenc. Now they are only in View and not duplicated anymore.) - Removed dead code. (Probably not all of it, but it's a first step...) - Fixed a leftover in storage/metric/end_to_end_test.go (that made some parts of the code never execute (incidentally, those parts were broken (and I fixed them, too))). Change-Id: Ibcac069940d118a88f783314f5b4595dce6641d5	2014-02-27 15:22:37 +01:00
Julius Volz	740d448983	Use custom timestamp type for sample timestamps and related code. So far we've been using Go's native time.Time for anything related to sample timestamps. Since the range of time.Time is much bigger than what we need, this has created two problems: - there could be time.Time values which were out of the range/precision of the time type that we persist to disk, therefore causing incorrectly ordered keys. One bug caused by this was: https://github.com/prometheus/prometheus/issues/367 It would be good to use a timestamp type that's more closely aligned with what the underlying storage supports. - sizeof(time.Time) is 192, while Prometheus should be ok with a single 64-bit Unix timestamp (possibly even a 32-bit one). Since we store samples in large numbers, this seriously affects memory usage. Furthermore, copying/working with the data will be faster if it's smaller. MEMORY USAGE RESULTS Initial memory usage comparisons for a running Prometheus with 1 timeseries and 100,000 samples show roughly a 13% decrease in total (VIRT) memory usage. In my tests, this advantage for some reason decreased a bit the more samples the timeseries had (to 5-7% for millions of samples). This I can't fully explain, but perhaps garbage collection issues were involved. WHEN TO USE THE NEW TIMESTAMP TYPE The new clientmodel.Timestamp type should be used whenever time calculations are either directly or indirectly related to sample timestamps. For example: - the timestamp of a sample itself - all kinds of watermarks - anything that may become or is compared to a sample timestamp (like the timestamp passed into Target.Scrape()). When to still use time.Time: - for measuring durations/times not related to sample timestamps, like duration telemetry exporting, timers that indicate how frequently to execute some action, etc. NOTE ON OPERATOR OPTIMIZATION TESTS We don't use operator optimization code anymore, but it still lives in the code as dead code. It still has tests, but I couldn't get all of them to pass with the new timestamp format. I commented out the failing cases for now, but we should probably remove the dead code soon. I just didn't want to do that in the same change as this. Change-Id: I821787414b0debe85c9fffaeb57abd453727af0f	2013-12-03 09:11:28 +01:00
Matt T. Proud	c262907fec	Kill interface cruft. These pieces were never used and should be thusly removed. Change-Id: I8dd151ec4c40b6d3ccffad1bb9b8b75a92e9ee37	2013-08-15 11:39:07 +02:00
Matt T. Proud	b23acccea8	Kill AppendSample interface definition. AppendSample will be repcated with AppendSamples, which will take advantage of bulks appends. This is a necessary step for indexing pipeline decoupling. Change-Id: Ia83811a87bcc89973d3b64d64b85a28710253ebc	2013-08-15 11:35:50 +02:00
Matt T. Proud	30b1cf80b5	WIP - Snapshot of Moving to Client Model.	2013-06-25 15:52:42 +02:00
Julius Volz	750f862d9a	Use GetBoundaryValues() for non-counter deltas.	2013-05-22 19:13:47 +02:00
Matt T. Proud	c07abf8521	Initial move away from skiplist.	2013-05-22 17:59:53 +02:00
Julius Volz	5b105c77fc	Repointerize fingerprints.	2013-05-21 14:28:14 +02:00
Matt T. Proud	e5ac91222b	Benchmark memory arena; simplify map generation. The one-off keys have been replaced with ``model.LabelPair``, which is indexable. The performance impact is negligible, but it represents a cognitive simplification.	2013-05-21 09:39:12 +02:00
Julius Volz	83c60ad43a	Fix GetMetricForFingerprint() metric mutability. Some users of GetMetricForFingerprint() end up modifying the returned metric labelset. Since the memory storage's implementation of GetMetricForFingerprint() returned a pointer to the metric (and maps are reference types anyways), the external mutation propagated back into the memory storage. The fix is to make a copy of the metric before returning it.	2013-05-14 16:46:30 +02:00
Julius Volz	ce1ee444f1	Synchronous memory appends and more fine-grained storage locks. This does two things: 1) Make TieredStorage.AppendSamples() write directly to memory instead of buffering to a channel first. This is needed in cases where a rule might immediately need the data generated by a previous rule. 2) Replace the single storage mutex by two new ones: - memoryMutex - needs to be locked at any time that two concurrent goroutines could be accessing (via read or write) the TieredStorage memoryArena. - memoryDeleteMutex - used to prevent any deletion of samples from memoryArena as long as renderView is running and assembling data from it. The LevelDB disk storage does not need to be protected by a mutex when rendering a view since renderView works off a LevelDB snapshot. The rationale against adding memoryMutex directly to the memory storage: taking a mutex does come with a small inherent time cost, and taking it is only required in few places. In fact, no locking is required for the memory storage instance which is part of a view (and not the TieredStorage).	2013-05-10 17:15:52 +02:00
Julius Volz	99dcbe0f94	Integrate memory and disk layers in view rendering.	2013-04-19 16:01:27 +02:00
Julius Volz	63625bd244	Make view use memory persistence, remove obsolete code. This makes the memory persistence the backing store for views and adjusts the MetricPersistence interface accordingly. It also removes unused Get* method implementations from the LevelDB persistence so they don't need to be adapted to the new interface. In the future, we should rethink these interfaces. All staleness and interpolation handling is now removed from the storage layer and will be handled only by the query layer in the future.	2013-04-18 22:26:29 +02:00
Matt T. Proud	a55602df4a	Validate diskFrontier domain for series candidate. It is the case with the benchmark tool that we thought that we generated multiple series and saved them to the disk as such, when in reality, we overwrote the fields of the outgoing metrics via Go map reference behavior. This was accidental. In the course of diagnosing this, a few errors were found: 1. ``newSeriesFrontier`` should check to see if the candidate fingerprint is within the given domain of the ``diskFrontier``. If not, as the contract in the docstring stipulates, a ``nil`` ``seriesFrontier`` should be emitted. 2. In the interests of aiding debugging, the raw LevelDB ``levigoIterator`` type now includes a helpful forensics ``String()`` method. This work produced additional cleanups: 1. ``Close() error`` with the storage stack is technically incorrect, since nowhere in the bowels of it does an error actually occur. The interface has been simplified to remove this for now.	2013-04-09 11:47:16 +02:00
Julius Volz	e096896932	PR comment fixups.	2013-03-26 15:28:00 +01:00
Julius Volz	dd67ab115b	Change GetAllMetricNames() to GetAllValuesForLabel().	2013-03-26 14:47:07 +01:00
Julius Volz	bdb067b47f	Implement remaining View Get* methods.	2013-03-21 18:11:04 +01:00
Julius Volz	6001d22f87	Change Get* methods to receive fingerprints instead of metrics.	2013-03-21 18:11:03 +01:00
Matt T. Proud	8cc5cdde0b	checkpoint.	2013-03-21 18:08:46 +01:00
Matt T. Proud	41068c2e84	Checkpoint.	2013-03-21 18:06:51 +01:00
Matt T. Proud	13ae29b304	Initial in-memory arena implementation. It is unbounded, and nothing uses it except for a gating flag in main.	2013-02-18 09:38:14 -06:00
Matt T. Proud	efbe0e8a12	Interface simplification. GetMetricForFingerprint(model.Fingerprint) (*Metric, error) -> GetMetricForFingerprint(model.Fingerprint) (Metric, error)	2013-02-14 08:43:02 -08:00
Matt T. Proud	f1245e8dda	Interface simplifications. GetFingerprintsForLabelName ([]*Fingerprint, error) -> GetFingerprintsForLabelName ([]Fingerprint, error)	2013-02-14 08:19:41 -08:00
Matt T. Proud	e8a733b525	Interface simplifications. GetFingerprintsForLabelSet ([]*Fingerprint, error) -> GetFingerprintsForLabelSet ([]Fingerprint, error)	2013-02-14 08:07:59 -08:00
Matt T. Proud	f03091b139	Interface simplifications: GetRangeValues From pointers to copies.	2013-02-13 21:11:23 -08:00
Matt T. Proud	14788cf4f3	Interface simplifications. GetBoundaryValues() from pointers to values.	2013-02-13 21:07:42 -08:00
Matt T. Proud	56f069b3ec	Interface simplifications: GetValueAtTime(). Pointer arguments to copies.	2013-02-13 21:05:01 -08:00
Matt T. Proud	1e2d6c7418	GetFingerprintsForLabelName simplifications. ``MetricPersistence.GetFingerprintsForLabelName(l *model.LabelName)`` -> ``MetricPersistence.GetFingerprintsForLabelName(l model.LabelName)``	2013-02-13 17:44:35 -08:00
Matt T. Proud	900bb988c1	Simplifications of GetFingerprintsForLabelSet. ``MetricPersistence.GetFingerprintsForLabelSet(s *model.LabelSet)`` -> ``MetricPersistence.GetFingerprintsForLabelSet(s model.LabelSet)``.	2013-02-13 17:13:41 -08:00
Matt T. Proud	4fbcea73f5	MetricPersistence.AppendSample signature changes. ``MetricPersistence.AppendSample(*model.Sample)`` -> ``MetricPersistence.AppendSample(model.Sample)``.	2013-02-13 13:46:28 -08:00
Julius Volz	d67e4b9131	Address outstanding comments from PR/47 and other cleanups.	2013-02-07 11:38:01 +01:00
Julius Volz	2659304380	Add interface for getting all metrics from a MetricsPersistence.	2013-02-06 17:06:39 +01:00
Matt T. Proud	ea54751431	Update import paths to new location. This repository moved from matttproud/prometheus to prometheus/prometheus, and all import paths need to be updated.	2013-01-27 18:49:45 +01:00
Matt T. Proud	3ac5d48b1a	Impl' storage i'faces and fix non-idiomatic warts. This change includes implementation of most major storage layer features, albeit some imperfect. It also includes nascent telemetry bindings, too.	2013-01-04 10:39:38 +01:00
Matt T. Proud	a14dbd5bd0	Interim commit for Julius.	2012-12-19 20:34:54 +01:00
Matt T. Proud	0886592ebc	New interface definition after discussion.	2012-12-12 12:53:34 +01:00
Matt T. Proud	59a708f25a	Provide prototype of storage layer interfaces.	2012-12-12 12:13:27 +01:00
Matt T. Proud	6589fc92f8	Strip web services, which weren't adding value.	2012-12-12 12:04:46 +01:00
Matt T. Proud	577acf4fe7	Exploding the storage infrastructure by contexts.	2012-12-09 16:27:12 +01:00
Matt T. Proud	15a6681651	Various cleanups. Kill LevelDB watermarks due to redundancy. General interface documentation has begun. Creating custom types for the model to prevent errors down the road. Renaming of components for easier comprehension. Exposition of interface in LevelDB. Slew of simple refactorings.	2012-12-08 14:03:08 +01:00
Matt T. Proud	044a5b4e14	Add nascent Travis CI configuration.	2012-11-29 20:00:09 +01:00
Matt T. Proud	6ed67b02e2	Incorporate new interface tests.	2012-11-28 18:52:04 +01:00
Matt T. Proud	44f8802ae7	Add Apache License 2.0 boilerplate.	2012-11-26 20:11:34 +01:00
Matt T. Proud	2bbdaa5790	Initial directory re-arrangement for storage.	2012-11-26 19:56:51 +01:00

48 commits