Просмотр сообщений

В этом разделе можно просмотреть все сообщения, сделанные этим пользователем.


Сообщения - zen.loop

Страницы: [1]
1
DjVu / Re: Программа DjVu Imager
« : 23 ДХТаРЫм 2013, 21:06:43 »
I'm sorry, I was just explaining my point. I think the tutorial is great!
With this line you wrote "I recommend you to play only with BSF and not to touch the Background quality (because the latter almost doesn't make sense)."
I think it's clear enough, as it is now.

2
DjVu / Re: Программа DjVu Imager
« : 23 ДХТаРЫм 2013, 20:59:36 »
monday2000:
насчет slices просто протупил - плохо читал мануал
его можно переделать - эти важные вещи просто не замечаются.
zen:
earlier you be advised to read djvu 3 spec
do it
or try and see the results of djvu imager work - it take a little time to learn what the best option.
0 - 35% quality in jpeg term, 50 - 60% quality.
BSF - downsampling similar to Acrobat PDF optimize

This is not for me, I was commenting on the tutorial for other people, for future new users. Just making a suggestion to improve the tutorial, in my opinion. You really think people should and will read 70 pages of a manual of a obscure program, just to learn how to do proper color scans? That's what the tutorial is for, getting a working knowledge all in one fast easy place. And that tutorial is very good already.
What I was saying, is that it would be nice to have a small one-line-or-two non-technical explanation or description of BSF and Background quality, and how they affect size and quality, which are important parameters. I was saying the "chunks explanation" might be a little too much, or instead just not useful because it would be insufficient.

3
DjVu / Re: Программа DjVu Imager
« : 23 ДХТаРЫм 2013, 17:46:21 »
Thanks. The BSF explanation is pretty clear, but the background quality is still not clear, I think.
I have no idea what the chunks, slices or c44.exe are.
Maybe it is explained in the manual, what I was hoping for is a very user oriented explanation.
What the user reading the tutorial will see is just the slider from 0 to 50. Approximately how does the size and quality increase, when I increase this?
Maybe then a reference to the manual for the rest of the explanation. The thing is, if I don't know what chunks and slices are, your explanation is not very useful.

This is just my personal experience from using the tutorial, thinking of an average user that has not much experience and just wants to learn the process for color scanning. Not really interested in the program inner workings.

If I don't check the BSF box, does it use the value 1 or 2?

4
DjVu / Re: Программа DjVu Imager
« : 22 ДХТаРЫм 2013, 14:54:03 »
Сделан английский вариант статьи "Использование DjVu Imager":

http://www.djvu-soft.narod.ru/scan/djvu_imager_en.htm

Статья, правда, значительно урезана по объёму - но, думаю, и так пока достаточно.

This is great, thanks a lot.

Could you clarify two things please?

-When I export from ST I don't get the files named like .sep.tiff, only .tiff, and it seems Djvu imager does not load the files without the .sep.tiff. I did this manually, and it worked.

- Maybe it would be better if you explained a little bit more about BSF and Background quality. What do they mean, and what happens exactly when you increase one or the other? What's the recommended value for Background quality? (just a couple of lines more).


Not related:
I was using ST "Enhanced" : http://sourceforge.net/projects/scantailor/files/scantailor-devel/enhanced/
mainly because of the Rectangular Mixed mode option.
Are you planning on having this feature on ST Featured?

5
DjVu / Re: djvu for tablets: ipad, kindle, nexus, surface, etc
« : 17 ДХТаРЫм 2013, 00:03:31 »
I have found a few. I start compiling them on the first post.
Contributions and opinions are welcome.
I have tried none of these.

6
DjVu / djvu for tablets: ipad, kindle, nexus, surface, etc
« : 15 ДХТаРЫм 2013, 19:23:28 »
What djvu applications exist for these tablets?

Say iPad, Kindle, Google Nexus, Microsoft Surface, Nook HD, Samsung Galaxy, Sony Xperia,...


iPad:

DjVu iPad Reader http://djvuipad.com/
DjVu Reader http://www.pasacreation.com/ https://itunes.apple.com/us/app/djvu-reader/id417854435?mt=8


Kindle:

Don't know of any.

Nexus:

EBookDroid https://play.google.com/store/apps/details?id=org.ebookdroid
VuViewer http://handheld.softpedia.com/get/Word-Processing-Text-Tools/Document-Viewer/VuViewer-118249.shtml

Surface:

WinDJView works on Windows 8, so this might work at least on Pro. Don't know about RT.

7
DjVu / Re: Full fledged color scanning with OCR
« : 15 ДХТаРЫм 2013, 19:12:39 »
Oh. I didn't notice there was a manual.. Sorry.

So, increasing subsample really does decrease quality. What we have to do is go to "Encode to DjVu(2)" and increase "Quality". In the manual it's says to choose it to 95.
And this does work well. I will try several of this combinations.

I might write an english tutorial on it in a couple of months. With several examples and combinations tabled and compared. Also thinking of improving the one existing and writing the one for djvusplit. So I was thinking of a tutorial for the 3 possibilities: b&w with few images (scan tailor+djvusmall), b&w text with lots of images (djvusplit), and full colour pages.

But the thing is I'm still not convinced djvu is the best for full color. Not that I know of an alternative, but it strikes me as odd that there is not a standard way known to do this, with good quality and small size. Comic books scans on the net are usually about 30MB per 100 pages, with great quality. But they are never djvu....

8
DjVu / Re: Full fledged color scanning with OCR
« : 15 ДХТаРЫм 2013, 15:35:30 »
zen.loop
Цитировать
This I don't understand what you mean. Could you be more specific, please? Which settings should I change?
I mean the program profiles. Try to experiment with them.

I was asking about the segmentation, and the text layers, I don't understand that. The encoding profiles I have been trying many.

9
DjVu / Re: Full fledged color scanning with OCR
« : 15 ДХТаРЫм 2013, 15:29:27 »
Цитировать
try djvusmall with Arcand 300 profile
step by step:
1. start DjvuSmall 0.4.4
2. drag and drop your tiff on program window
3. in LEFT LOWER corner present dropdown with label "Select encoding profile". Use it to set Arcand Scanned (300 dpi)
4. Press button "Convert"
5. If backgrond layer too blurry, press button "Options"
6. Press "Encode to Djvu"
7. In RIGHT LOWER corner check option "Bg subsample" & set it to 1 - obtain 1:1 background resolution. Try bigger value - size=quality/bg subsample
8. Close window by click on RED button in UPPER RIGHT corner
9. Press "Convert"
10. If do not satisfied with size - in step 7 try bigger value: size=quality/bg subsample

Thanks, NBell. I had tried this but I was not understanding what "Bg subsample" values mean. Because when I change from 1 to higher the quality actually diminishes. Changing from 1 to higher makes the file smaller and more blurry. So it seems to me the best option I the default Arcand300dpi, without changing the settings. But this is still a bit blurry for me.

Can you download this sample scan?
https://www.wetransfer.com/downloads/4dc85a692d2c73d6e6be01508fa18d7c20130215105716/de627e#
It's a raw color tiff scan at 300 dpi.
13MB

If I use directly djvusmall (with no scantailor before) on the raw file, then with:
Arcand 300 dpi , I get a 180KB file, which is not very bad quality, but still too blurry.
Photo600, I get a 200KB not very good.
Photo300 is exactly the same as Photo600.
Aggressive600 gives me a 167KB file, not very good.

If I use scantailor and output it at Photo600 I get a 50 MB tiff file, with great quality.
Then I used several options over this 50 MB file on djvu small:
Aggressive600 to get a 222KB file with no blurring, great.
Arcand300 to get a 298KB a bit blurred.
Photo600 to get a 315KB with no blurring, great, looks like the Aggressive600.

I'll probably try this now, when the raw tiff scan is 600dpi.
Anyway the results above show that there is not a very great difference in size, and Arcand always the more blurry.
Is there other settings you recommend me trying?
Maybe I'll go for the Aggresive600, after output with scantailor at Photo600.
As I said, I did not understand the Bg subsample, because it got worse and smaller when I increased from 1 to other values.



Цитировать
use FR11 DjVu Text Layer Crutch for FineReader 11.0.102.583 text layer transfer

I tried this, and found some small russian instructions on a forum (which I translated with google), but it gave me an error: it did not open the djvu file with ocr that FR11 saved. The interface is all in russian unfortunately.

10
DjVu / Re: Full fledged color scanning with OCR
« : 14 ДХТаРЫм 2013, 02:45:29 »
ABBYY FineReader the latest versions. Load your DjVu in it, OCR it, and generate the OCRed DjVu. Then use DjVuOCR 2 program to transpose the OCR layer from the ABBYY-made DjVu (which would be too big-sized) into original one.

Ah, I see. Indeed I had tried it with abbyy alone and it was not good.

You might experiment with the automatic segmention in DjVu Small - in order to auto-separate the white text into the text layer. In case of success it will cut the size of the final DjVu.

This I don't understand what you mean. Could you be more specific, please? Which settings should I change?

11
DjVu / Re: Full fledged color scanning with OCR
« : 13 ДХТаРЫм 2013, 02:33:24 »
People have been scanning color magazines and comic books for years, right?
What do they do? What's the standard quality process? And how large are the final files?

12
DjVu / Re: Full fledged color scanning with OCR
« : 12 ДХТаРЫм 2013, 23:37:58 »
Thank, monday2000 and NBell!

I did try to follow that tutorial for scantailor, but I couldn't do it.
But anyway what I'm really interested is when the whole page is an image, so no need for text layer and image layer, no need for split.

Can you see this image?
http://postimage.org/image/c1zcelzqz/
This is what I want to know how to do, with good quality and not huge files.

I asked the guy who scanned that file at another site, he told me he scanned it at 600 dpi tiff, then used photoshop to crop, edit and compress to jpeg (I don't know how exactly), and then use acrobat to produce the pdf and OCR. In the end it's about 1 MB per page. Which I think is still too big.

If I scan at 300 dpi tiff, and use scantailor at output of Photo600 and then djvusolo/small at Photo600, I get about the same size and quality too.
But if I do it at 300 dpi, it gets all blurry and watery.

And how do we do the OCR on top of that full color djvu, which program do we use? Is the final size the same?

Thanks a lot for the feedback!

13
DjVu / Re: Full fledged color scanning with OCR
« : 10 ДХТаРЫм 2013, 20:39:45 »
Thanks, I'll try it.

I have to say though, I'm not really understanding how to work with djvu imager. I tried it with ScanTailor and the tutorial here: http://www.djvu-soft.narod.ru/scan/djvu_imager.htm
but something's not working, I don't understand the workflow needed.

And I don't know what to do with the subsample on djvusmall. What does this do and mean? What values are should I put in? It's marked to "1" per default.

14
DjVu / Full fledged color scanning with OCR
« : 05 ДХТаРЫм 2013, 19:40:50 »
Hi,

I have already scanned several books using 300 dpi tiff scans -> scantailor -> djvusolo or djvusmall.
However now I want to scan books with many large color illustrations on every page, a background that is not white, colored text and I want to OCR it.

Can someone tell me what's the best way to do this?
For an A4 page with a full page color photo what size can I expect?
If I use a tiff scan at 300dpi about 20MB (600 dpi, about 60MB), a scantailor process at 300 dpi (600dpi) output and a djvusolo saving at Photo 300 dpi (600 dpi), I get a file of about 500 KB (about 3 MB).
The 600 dpi image quality is great.
But the 300dpi case has some areas a bit blurry after scantailor and djvusolo.

Can we do better with djvu imager for example?
Is there another way?
I have been discussing this on LibraryGenesis forum here: http://genofond.org/viewtopic.php?f=1&t=6733&start=20
People have told me about didjvu and img2djvu, but I have not yet tried them.

So imagine that, as an example, I want to scan and OCR these sample pages:
http://whatimg.com/i/0wlfh.jpg
http://whatimg.com/i/zjd2b.jpg
http://whatimg.com/i/1x8ej.jpg

Thanks!

Страницы: [1]