Zipkin is a distributed tracing system. It helps gather timing data needed to troubleshoot latency problems in microservice architectures. It manages both the collection and lookup of this data through a Collector and a Query service. Zipkin’s design is based on the Google Dapper paper.
Collecting traces helps developers gain deeper knowledge about how certain requests perform in a distributed system. Let’s say we’re having problems with user requests timing out. We can look up traced requests that timed out and display it in the web UI. We’ll be able to quickly find the service responsible for adding the unexpected response time. If the service has been annotated adequately we can also find out where in that service the issue is happening.
nload is a console application which monitors network traffic and bandwidth usage in real time. It visualizes the in- and outgoing traffic using two graphs and provides additional info like the total amount of transfered data and min/max network usage.
Slurm (version 0.4.0) shows network activity for a given interface. Total uploads and downloads are shown together in text and ASCII graphics. Data updates every second by default, and the graph scrolls from left to right according to network usage.
I was looking for some monitoring tool…
I’v signed up on Datadog… why?
It has some awesome clients and the free account handles 5 hosts…
It took me less than 5 minutes to have it working and pumping graphics!