Packetbeat Blog @packetbeat - Tumblr Blog

Packetbeat Blog

@packetbeat

Packetbeat is a service for monitoring and troubleshooting web applications. It sniffs the traffic from your network and helps DevOps by giving them real-time visibility into how the various applications and their interfaces are performing. Sign up to get visibility in your network!

2 posts

How to troubleshoot distributed systems

More and more Internet services need to have a complex distributed architecture in order to handle a large number of clients and remain available in high load conditions. These applications are usually a collection of smaller or larger software entities, most often running on many different physical servers, each having its own purpose. Quite often, these components are written by different teams, using different frameworks and programming languages. Understanding the behavior of such a system requires observing the communication between the entities that are running on different servers.

As an example, let’s consider a simple blogging platform architecture When a user wants to see the latest posts, then the browser generates a GET /posts/ request and sends it to the web server, that forwards the HTTP request to one of the web app servers. The web app interrogates the database to get a list of the latest posts and encodes them to send them back in a HTTP response to the twitter web server that relays it to the user. Everything looks clean and easy, but what happens in case the user gets back an error instead of the list of posts? The issue can be anywhere on the path from the user’s browser down to the database.

To track down the issue, the first idea that you have is probably to check the logs of each server involved in the process of getting the posts and look for exceptions. In case you are unlucky and you don’t find anything in the logs, then you try reproducing the issue while the services are running in debug level and tracing on each server. After hours of debugging, you find out that the issue was a race condition between the cleanup process and the database. Indeed this is a nasty bug that requires a lot of investigation. But what if you would have a system to show you all the transactions between all the servers involved in the process of getting the posts? What if this system would show you the root cause in couple of seconds?

#troubleshoot distributedsystems

•18+ Adults Only

Watch Anya Live on Cam

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.

✓ Live Streaming✓ Interactive Chat✓ Private Shows✓ HD Quality✓ Free Actions

Free to watch • No registration required • HD streaming

Welcome

Hi everyone. Welcome to Packetbeat Blog. We are a startup base in Berlin, Germany. At Packetbeat, we believe that Operations teams can reach the highest standards of reliability and efficiency only if they are data driven and if they consider visibility, transparency and collaboration as their core values. Our mission is to create products that help DevOps and Ops teams troubleshoot and monitor the applications for which they are responsible.

Stay tuned. More info to come.

#webmonitoring #troubleshoot web

Trending Blogs

Last Seen Blogs

Packetbeat Blog