Well, well. Kind of surprised to see this really good tool that should have been...

mappu · 2025-10-06T21:58:11 1759787891

Specialization for file formats is not novel (e.g. 7-Zip uses BCJ2 prefiltering to convert x86 opcodes from absolute to relative JMP instructions), nor is embedding specialized decoder bytecode in the archive (e.g. ZPAQ did this and won a lot of Matt Mahoney's benchmarks) but i think OpenZL's execution here, along with the data description and training system, is really fantastic.

nunobrito · 2025-10-06T22:57:44 1759791464

Thanks, I've enjoyed reading more about ZPAQ but their main focus seems to be versioning (which is quite a useful feature too, will try it later) but they don't include specialized compression per context.

Like you mention, the expandability is quite something. In a few years we might see a very capable compressor.