latency – Tarik Billa

What is the difference between latency and response time?

September 24, 2023 by Tarik

One way of looking at this is to say that transport latency + processing time = response time. Transport latency is the time it takes for a request/response to be transmitted to/from the processing component. Then you need to add the time it takes to process the request. As an example, say that 5 people … Read more

How to prefetch data using a custom python function in tensorflow

August 1, 2023 by Tarik

This is a common use case, and most implementations use TensorFlow’s queues to decouple the preprocessing code from the training code. There is a tutorial on how to use queues, but the main steps are as follows: Define a queue, q, that will buffer the preprocessed data. TensorFlow supports the simple tf.FIFOQueue that produces elements … Read more

Fastest technique to pass messages between processes on Linux?

June 1, 2023 by Tarik

Whilst all the above answers are very good, I think we’d have to discuss what is “fastest” [and does it have to be “fastest” or just “fast enough for “?] For LARGE messages, there is no doubt that shared memory is a very good technique, and very useful in many ways. However, if the messages … Read more

What is the difference between latency, bandwidth and throughput?

May 24, 2023 by Tarik

Water Analogy: Latency is the amount of time it takes to travel through the tube. Bandwidth is how wide the tube is. The rate of water flow is the Throughput Vehicle Analogy: Vehicle travel time from source to destination is latency. Types of Roadways are bandwidth. Number of Vehicles traveling is throughput.

preconnect vs dns-prefetch resource hints

May 11, 2023 by Tarik

I’ve been researching the topic a bit lately and so far my (theoretical) conclusions are as follows: Browser support is high for both as of November 2022, when counting the real global usage of browsers (~94% vs ~83%) dns-prefetch = DNS and preconnect = DNS + TCP + TLS. Note that DNS lookup is quite … Read more

What do we mean by “top percentile” or TP based latency?

May 2, 2023 by Tarik

tp90 is a maximum time under which 90% of requests have been served. Imagine you have times: 10s 1000s 100s 2s Calculating TP is very simple: sort all times in ascending order: [2s, 10s, 100s, 1000s] find latest item in portion you need to calculate. For TP50 it will ceil(4*.5)=2 requests. You need 2nd request. … Read more

Why does Monitor.PulseAll result in a “stepping stair” latency pattern in signaled threads?

April 29, 2023 by Tarik

One difference between these version is that in PulseAll case – the threads immediately repeat the loop, locking the object again. You have 12 cores, so 12 threads are running, execute the loop, and enter the loop again, locking the object (one after another) and then entering wait state. All that time the other threads … Read more

How does Amazon RDS backup/snapshot actually work?

January 7, 2023 by Tarik

fastest (low latency) method for Inter Process Communication between Java and C/C++

January 2, 2023 by Tarik

Just tested latency from Java on my Corei5 2.8GHz, only single byte send/received, 2 Java processes just spawned, without assigning specific CPU cores with taskset: TCP – 25 microseconds Named pipes – 15 microseconds Now explicitly specifying core masks, like taskset 1 java Srv or taskset 2 java Cli: TCP, same cores: 30 microseconds TCP, … Read more

Reducing garbage-collection pause time in a Haskell program

December 2, 2022 by Tarik

You’re actually doing pretty well to have a 51ms pause time with over 200Mb of live data. The system I work on has a larger max pause time with half that amount of live data. Your assumption is correct, the major GC pause time is directly proportional to the amount of live data, and unfortunately … Read more