update of gui

new

Dwing
Guest

update of gui Feb 24, 2005 1:22:42 GMT 10

Quote

Post by Dwing on Feb 24, 2005 1:22:42 GMT 10

-={DOGG}=- said:

hey dwing good to hear from you

Would it be possible to add a custom model-size? Currently mode-3 is 194 MB. If you added an option to add a custom value, i believe it would improve compression. Personally i would use 400MB.

Well, PAQ algorithm doesn't like LZ and PPM, PAQ has dict and some models which need independent memory section. And they must be calculated by power of 2 (1M/2M/4M/8M/16M). So memory size can't be custom easily. Mode-3(194M) was one of optimized mode. If there is mode-4, it would be about 400M.
However, I don't support this mode, because its improvement is very very slight. The reason is the limitation of UDA algorithm. (UDA algorithm is not exact PAQ6, because considered speed, I removed some inefficient models, reserved efficient models by long-time experiment)

PAQAR uses much more models than PAQ6 and more complicated weight statistics. But its basic principle is not changed: more compression ratio, and more memory and time. I think it's really not worth using.

All in all, PAQ is really well known as "perfect arithmetic code".

Dwing
Guest

update of gui Feb 24, 2005 1:43:55 GMT 10

Quote

Post by Dwing on Feb 24, 2005 1:43:55 GMT 10

Guest-Pliskin said:

Hi Dwing, that's fantastic news!!!

How about using something like the little trick below with WinUDA. Looking forward to seeing WinUDA take its rightful place in the near future as best file compressor.

PAQ's BMP compression algo can be greatly improved simply keeping the
recordmodel fixed to the width (multiplied by 3 if it's a 24bit image)
of the bmp. I remember that using this simple trick my paq607fb did
compress rafale approx to 615k.

The above text was copied from here...

www.groupsrv.com/science/about74708-105.html

PAQ algo is well known by its great self-adaptation of many kinds of mixed data. But how to detect bmp data? I like the multimedia detection of UHARC, but don't know how it works.
PS: I'm sorry to say that I can't get to the address above.

-={DOGG}=- Administrator Posts: 76	update of gui Feb 24, 2005 1:44:41 GMT 10 Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by -={DOGG}=- on Feb 24, 2005 1:44:41 GMT 10 Ahhh ok thanks for the information. I guess you understand all this stuff MUCH better than me Do you think WinUDA will improve much in the future? In terms of compression and/or speed?

-={DOGG}=-
Administrator

Posts: 76

update of gui Feb 24, 2005 2:01:38 GMT 10

Quote

Post by -={DOGG}=- on Feb 24, 2005 2:01:38 GMT 10

Guest-Dwing said:

PS: I'm sorry to say that I can't get to the address above.

i've compressed the page if you want a look

members.optushome.com.au/dogg01/fordwing.uda (0.90 classic)

By the way have you considered seperating individual files into chunks? for example like winrk? Like have a fast algorithm check the compressability of each file or something?

Dwing
Guest

update of gui Feb 24, 2005 11:52:19 GMT 10

Quote

Post by Dwing on Feb 24, 2005 11:52:19 GMT 10

-={DOGG}=- said:

Ahhh ok thanks for the information. I guess you understand all this stuff MUCH better than me

Do you think WinUDA will improve much in the future? In terms of compression and/or speed?

I have studied PAQ a lot. Its principle is very great and easy to learn. The core thinking of PAQ is prediction and self-adaptation. The more accurate it predicts, the more compression ratio. It maybe the most efficient compression algo so that whichever you changed a bit of compressed data, it would not find any error and goon decompressing.

I also like UHARC and 7-zip. UHARC combine LZP, PPM and multimedia detection to make faster than other PPM compresser. 7-zip combine LZ77, arithmetic code and other tricks so that it's the best compresser based on classical LZ algo. Due to its source, I also studied a lot, and made my exepacker.

Now WinUDA have enough optimization, unless there is better algo, WinUDA would not improve compression ratio and speed.

Dwing
Guest

update of gui Feb 24, 2005 12:02:45 GMT 10

Quote

Post by Dwing on Feb 24, 2005 12:02:45 GMT 10

-={DOGG}=- said:

Emm... The biggest problem is how to detect data-type, and sometimes it would not improvement and waste much time. This way would not improve speed, but a little compression ratio. I believe PAQ's great self-adaptation doesn't lose much, especially PAQAR's more complicated weight statistics.

-={DOGG}=- Administrator Posts: 76	update of gui Feb 24, 2005 13:09:44 GMT 10 Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by -={DOGG}=- on Feb 24, 2005 13:09:44 GMT 10 Hmmm OK thanks for the info. It's good to know how these things work. It looks like WinUDA is pretty much as goog as it's going to get.

Matt Mahoney
Guest

update of gui Mar 1, 2005 12:21:57 GMT 10

Quote

Post by Matt Mahoney on Mar 1, 2005 12:21:57 GMT 10

The earliest example of neural network data compression that I am aware
of is Schmidhuber, J=FCrgen, and Stefan Heil (1996), "Sequential neural
text compression", IEEE Trans. on Neural Networks 7(1): 142-146. They
trained a 3 layer network by back propagation to predict the next
character given a 5 character context as input, and coded the
prediction using arithmetic coding. They improved a bit on UNIX
"compress" but it took days of CPU to train the model on just a few KB
of text. Also, back propagation of multi-layer networks requires
multiple training passes, so it was offline. They only compressed text
(in German), so their alphabet was size 80 rather than 256.

In 2000 I wrote some neural network compressors (P5, P6, P12) fast
enough to be practical, and a paper,
cs.fit.edu/~mmahoney/compression/nn_paper.html
The compression is better than gzip and the Schmidhuber-Heil system
but not as good as PPM. The major improvements were to eliminate
multiple training passes to make it online, train only the last layer,
to treat the input as a stream of bits rather than bytes (to speed up
the range coder), and to keep a 0/1 count history with each weight to
contral the learning rate. Later I dropped the weights and just kept
the 0/1 counts and the result was PAQ.

One of these days I will write PAQ7 and it will probably use neural
networks to combine predictions from different models. I have been
getting some good results combining predictions using:

p =3D squash(SUM(w * stretch(p)))

where p is the probability that the next bit will be a 1 according
to the i'th model, stretch(p) =3D ln(p/(1-p)), squash(x) =3D 1/(1 + e^-x)
(inverse of stretch), and p is the final output to the range coder.
The weights w are updated using normal backpropagation:

w +=3D r * p * (1-p) * (y-p)

where y is the actual bit and r (about .001 to .01) is the learning
rate.

The neural network would probably replace the fixed weights now used to
average probabilities from the different mixer outputs in PAQAR. PAQAR
gets good compression because it combines a huge number of contexts and
models. I think these models could be computed in parallel to make the
speed reasonable.

-- Matt Mahoney

-={DOGG}=- Administrator Posts: 76	update of gui Mar 2, 2005 10:42:47 GMT 10 Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by -={DOGG}=- on Mar 2, 2005 10:42:47 GMT 10 Wow, sounds good. i can't wait for PAQ7. Do you plan on making a GUI or are you just doing a proof-of-concept kind of thing?

Matt Mahoney
Guest

update of gui Oct 22, 2005 10:55:41 GMT 10

Quote

Post by Matt Mahoney on Oct 22, 2005 10:55:41 GMT 10

Hi. I'm not working on a GUI for PAQ6/7. My interest is in discovering new compression algorithms, though I haven't had much time to work on it lately (real work gets in the way, note the 6 month gap in this reply). I haven't released anything in awhile since I haven't been able to beat what's currently available. I tried replacing the PAQ mixing formula (weighted sum of 0 and 1 counts) with a neural network and it does improve compression, speed, and memory a bit, but it doesn't yet beat PAQAR or WinRK. I also tried a .bmp model (as suggested by Fabio) and it does improve over PAQAR and PAsQDa but haven't been able to beat WinRK. So I have not released anything yet.

Personally I prefer a command line interface over a GUI. I usually use it if an archiver gives me the choice (like zip or rar). Doing PAQ this way makes it usable in UNIX and Linux. I figure if the engine is good, someone else (like Dwing) will build a GUI around it. I take that as a compliment.

-- Matt Mahoney