-
Updated
Mar 11, 2022
site-reliability-engineering
Chaos engineering is the discipline of experimenting on a software system in production in order to build confidence in the system's capability to withstand turbulent and unexpected conditions. Chaos engineering is a disciplined approach to identifying failures before they become outages
Here are 62 public repositories matching this topic...
-
Updated
Apr 13, 2022 - JavaScript
-
Updated
Mar 10, 2022
Capability Questions
Issue Description
Question
Describe what happened (or what feature you want)
Trying to evaluate ChaosBlade as an option for resiliency testing. But I'm not sure if this is a feature request or a question. Actually, two questions:
- Does ChaosBlade support Azure, or can it be extended to support Azure?
- Can ChaosBlade inject failures into a Platform as a Service (Pa
We can add an upgrade/downgrade command for litmusctl binary, it can look at the matrix of versions in a file and upgrade/downgrade according to the user's choice.
example
- litmusctl upgrade v0.5.0
- litmusctl downgrade v0.4.0
-
Updated
Mar 23, 2022 - Go
It seems to me that UTC is selected for on the wire representation of time as well as in the database (jaegertracing/jaeger#712), which sort of makes sense, at least with a somewhat naive handling of timezones. However, I think that the Jaeger UI should support displaying times in the timezone local to the user, i.e. of the browser as to reduce the mental load when viewing
-
Updated
Mar 7, 2022
-
Updated
Nov 8, 2021
-
Updated
Jun 16, 2020
-
Updated
May 6, 2020
-
Updated
Mar 22, 2017
-
Updated
Dec 10, 2018 - TeX
-
Updated
May 6, 2018
Although it's not a high priority, we could get a more fancy and modern wheel.
-
Updated
May 23, 2020 - Go
-
Updated
Feb 14, 2021 - HTML
-
Updated
Nov 27, 2019
-
Updated
Apr 16, 2022
-
Updated
Mar 23, 2022 - Makefile
-
Updated
Aug 24, 2018
-
Updated
Feb 20, 2022
-
Updated
Jan 16, 2022
-
Updated
Sep 13, 2021
This is a rewrite of #129 to make it easier to parse :-)
Background
Prometheus is de-facto standard for monitoring applications in the cloud native space. One of the core conceits here is the idea of "time-series" data (look at the Prometheus docs to get a better idea) for metrics. At a high level, you can just think of it as a continues series of values for som
-
Updated
Nov 25, 2020 - Ruby
-
Updated
Feb 2, 2017
-
Updated
May 17, 2018
The library
controller-runtimerequires setting a logger (bylog.SetLogger()) at the first 30s when the application starts, or it would use the defaultNullLogSink. We should also call it in testing codes.When we test with ginkgo, ginkgo provides a helpful
GinkgoWriter, which hides the output as default, only prints it when the test failed. We'd better use it to keep our testing output