What is the current state of text-only compression algorithms?

The boundary-pushing compressors combine algorithms for insane results. Common algorithms include: The Burrows-Wheeler Transform and here – shuffle characters (or other bit blocks) with a predictable algorithm to increase repeated blocks which makes the source easier to compress. Decompression occurs as normal and the result is un-shuffled with the reverse transform. Note: BWT alone doesn’t … Read more

h264 lossless coding

I am going to add a late answer to this one after spending all day trying to figure out how to get YUV 4:4:4 pixels into x264. While x264 does accept raw 4:2:0 pixels in a file, it is really quite difficult getting 4:4:4 pixels passed in. With recent versions of ffmpeg, the following works … Read more

Why does base64-encoded data compress so poorly?

Most generic compression algorithms work with a one-byte granularity. Let’s consider the following string: “XXXXYYYYXXXXYYYY” A Run-Length-Encoding algorithm will say: “that’s 4 ‘X’, followed by 4 ‘Y’, followed by 4 ‘X’, followed by 4 ‘Y’” A Lempel-Ziv algorithm will say: “That’s the string ‘XXXXYYYY’, followed by the same string: so let’s replace the 2nd string … Read more

Is JPEG lossless when quality is set to 100?

As correctly answered above, using a “typical” JPEG encoder at quality 100 does not give you lossless compression. Lossless JPEG encoding exists, but it’s different in nature and seldom used. I’m just posting to say why quality 100 does not mean lossless. In JPEG compression information is mostly lost during the DCT coefficient quantization step … Read more

Hata!: SQLSTATE[HY000] [1045] Access denied for user 'divattrend_liink'@'localhost' (using password: YES)