A new data compressor called ZX0

Alone Coder · Post by **Alone Coder** » Thu Sep 02, 2021 10:09 am

evilpaul wrote: ↑Thu Sep 02, 2021 8:56 am
rastersoft wrote: ↑Thu Sep 02, 2021 7:50 am Mmm... Does anybody have interest in my text compression/decompression routines from Escape from M.O.N.J.A.S.? The output is about the 59% size of the original, each string is individually addressable, and can be decompressed on-the-fly without needing any kind of buffer. If there is interest, I can extract it and put some instructions...
I need to do something with text compression for my Zelda-lite game. I have a bunch of ideas, but I would be interested to hear your approach. Maybe open another thread?

One simple approach is 5 bit per letter and 3 bits for space. I've used this in one 128 byte intro. However, the punctuation will require separate treatment.
Huffman encoding gives better results for big amount of data (in ZX-Guide #2 and 3, some of the articles were Huffman encoded, each line individually).
In ZX-Time magazine, all the text was in 5 bits per letter with some filtering on the result. One possible filter is to change the register after . ! ?

Einar Saukas · Post by **Einar Saukas** » Thu Sep 02, 2021 3:19 pm

Urusergi wrote: ↑Wed Sep 01, 2021 8:03 pm This time I have achieved a more important speed optimization, around 2%

Awesome, thank you!!!

rastersoft · Post by **rastersoft** » Thu Sep 02, 2021 3:50 pm

evilpaul wrote: ↑Thu Sep 02, 2021 8:56 am
rastersoft wrote: ↑Thu Sep 02, 2021 7:50 am Mmm... Does anybody have interest in my text compression/decompression routines from Escape from M.O.N.J.A.S.? The output is about the 59% size of the original, each string is individually addressable, and can be decompressed on-the-fly without needing any kind of buffer. If there is interest, I can extract it and put some instructions...
I need to do something with text compression for my Zelda-lite game. I have a bunch of ideas, but I would be interested to hear your approach. Maybe open another thread?

My system is based in LZSS (this is: encoding a pair distance-length for each block). The decompression is fast as light, but the compression is another story. In fact, I did some research and it seems to not exist an optimal way of doing it. I do it in an iterative fashion, by compressing the sentences, reordering them and trying again, keeping the best compression achieves up to that moment. It takes advantage of the fact that ASCII is 7 bits, so if a byte has its 7th bit set, it and the next is a pair distance-length (12 bits for distance from the beginning of the compressed block, and three bits for size, from 3 to 10 bytes).

The input is a source code file with each sentence in a DB statement, with a label immediately before for each sentence. The output is a source code file too, with the same labels but the DBs contain the compressed sentences. Each sentence must end in 0.

rastersoft · Post by **rastersoft** » Thu Sep 02, 2021 3:50 pm

Alone Coder wrote: ↑Thu Sep 02, 2021 10:09 am
evilpaul wrote: ↑Thu Sep 02, 2021 8:56 am I need to do something with text compression for my Zelda-lite game. I have a bunch of ideas, but I would be interested to hear your approach. Maybe open another thread?
One simple approach is 5 bit per letter and 3 bits for space. I've used this in one 128 byte intro. However, the punctuation will require separate treatment.
Huffman encoding gives better results for big amount of data (in ZX-Guide #2 and 3, some of the articles were Huffman encoded, each line individually).
In ZX-Time magazine, all the text was in 5 bits per letter with some filtering on the result. One possible filter is to change the register after . ! ?

I tried that, but compression was not good.

Urusergi · Post by **Urusergi** » Mon Sep 06, 2021 3:29 pm

A crazy idea?

if we use a totally negative newoffset (forwards) I think it would work, and pretty fast and the best thing is that we would stop using alternative registers.