<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Mindhacks For Fingertips Follow-Up &#8211; Plus Earhacks</title>
	<atom:link href="http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/feed/" rel="self" type="application/rss+xml" />
	<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/</link>
	<description>Out of the crooked timber of humanity, no straight thing was ever made</description>
	<lastBuildDate>Sun, 27 May 2012 09:28:09 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>By: Ray Davis</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-293742</link>
		<dc:creator>Ray Davis</dc:creator>
		<pubDate>Mon, 02 Nov 2009 15:06:07 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-293742</guid>
		<description>I do a &lt;a href=&quot;http://www.pseudopodium.org/repress/index.html&quot; rel=&quot;nofollow&quot;&gt;fair amount of OCR-ing&lt;/a&gt;, and I invested in ABBYY FineReader many years and versions ago. One of its selling points is &quot;trainability,&quot; so I&#039;m pleased to hear about Readiris&#039;s support. For those of us drawn to older texts printed variably or in not-so-typical typefaces, that feature is well worth the extra tinkering. (You can see what happens without it in massive library trashings like the grant-funded &quot;The Million Illegible Books Project&quot; and the Google-funded whatchamacallit.)</description>
		<content:encoded><![CDATA[	<p>I do a <a href="http://www.pseudopodium.org/repress/index.html" rel="nofollow">fair amount of <span class="caps">OCR</span>-ing</a>, and I invested in <span class="caps">ABBYY </span>FineReader many years and versions ago. One of its selling points is &#8220;trainability,&#8221; so I&#8217;m pleased to hear about Readiris&#8217;s support. For those of us drawn to older texts printed variably or in not-so-typical typefaces, that feature is well worth the extra tinkering. (You can see what happens without it in massive library trashings like the grant-funded &#8220;The Million Illegible Books Project&#8221; and the Google-funded whatchamacallit.)</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Cian O'Connor</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-293376</link>
		<dc:creator>Cian O'Connor</dc:creator>
		<pubDate>Fri, 30 Oct 2009 15:20:24 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-293376</guid>
		<description>http://unpaper.berlios.de/
Is what I was thinking of. It will do a pretty good job of cleaning up PDF scans prior to OCR scans.
The following app has a version that works on windows, and probably the Mac as well:
http://www.mobileread.com/forums/showthread.php?t=21906</description>
		<content:encoded><![CDATA[	<p><a href="http://unpaper.berlios.de/" rel="nofollow">http://unpaper.berlios.de/</a><br />
Is what I was thinking of. It will do a pretty good job of cleaning up <span class="caps">PDF</span> scans prior to <span class="caps">OCR</span> scans.<br />
The following app has a version that works on windows, and probably the Mac as well:<br />
<a href="http://www.mobileread.com/forums/showthread.php?t=21906" rel="nofollow">http://www.mobileread.com/forums/showthread.php?t=21906</a></p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Cian O'Connor</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-293370</link>
		<dc:creator>Cian O'Connor</dc:creator>
		<pubDate>Fri, 30 Oct 2009 15:05:25 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-293370</guid>
		<description>Abbyy Finereader is supposed to be the best OCR system out there, though its PC only. I have version 9 (I think), which is excellent. Version 10 is supposed to be better. Handles OCR, and converts PDFs to HTML/Word -&gt; and is pretty reasonable with footnotes/diagrams/fonts. There&#039;s also a cut price version for 50-60$ which is supposed to be pretty decent for stuff that is just text, or not too complex layouts. The favourite of librarians and e-book pirates apparently.
Omni is pretty crap by all accounts, unless you&#039;re an expert tweaker.

There&#039;s some piece of freeware that I&#039;ve lost (mostly I search the pirate sites for scans, as its easier, than scanning in myself) which will clean up scans and so improve OCR. Its part of the Linux OCR suite that uses Tesseract (which is okay for free, but if time is money...), and is very useful.

Incidentally, a LOT of academic books are out there on filesharing sites these days. I&#039;m not sure what that&#039;s an indicator of.</description>
		<content:encoded><![CDATA[	<p>Abbyy Finereader is supposed to be the best <span class="caps">OCR</span> system out there, though its PC only. I have version 9 (I think), which is excellent. Version 10 is supposed to be better. Handles <span class="caps">OCR</span>, and converts PDFs to <span class="caps">HTML</span>/Word -> and is pretty reasonable with footnotes/diagrams/fonts. There&#8217;s also a cut price version for 50-60$ which is supposed to be pretty decent for stuff that is just text, or not too complex layouts. The favourite of librarians and e-book pirates apparently.<br />
Omni is pretty crap by all accounts, unless you&#8217;re an expert tweaker.</p>

	<p>There&#8217;s some piece of freeware that I&#8217;ve lost (mostly I search the pirate sites for scans, as its easier, than scanning in myself) which will clean up scans and so improve <span class="caps">OCR</span>. Its part of the Linux <span class="caps">OCR</span> suite that uses Tesseract (which is okay for free, but if time is money&#8230;), and is very useful.</p>

	<p>Incidentally, a <span class="caps">LOT</span> of academic books are out there on filesharing sites these days. I&#8217;m not sure what that&#8217;s an indicator of.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew Brown</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-293272</link>
		<dc:creator>Andrew Brown</dc:creator>
		<pubDate>Thu, 29 Oct 2009 19:09:04 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-293272</guid>
		<description>Abbyy Finereader is pretty competent at all the things you want. PC only, so far as I know, but very quick, multilingual, and dead accurate on high quality scans. I have version 7, and have never felt any need to upgrade. So I don&#039;t know if it has been bloated all to hell since then.</description>
		<content:encoded><![CDATA[	<p>Abbyy Finereader is pretty competent at all the things you want. PC only, so far as I know, but very quick, multilingual, and dead accurate on high quality scans. I have version 7, and have never felt any need to upgrade. So I don&#8217;t know if it has been bloated all to hell since then.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Salient</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-293177</link>
		<dc:creator>Salient</dc:creator>
		<pubDate>Wed, 28 Oct 2009 12:07:53 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-293177</guid>
		<description>I have become a bit confused. If anyone wishes to help orient me, it&#039;s much appreciated:

* Which of these programs is good at recognizing when symbols don&#039;t make sense to it (e.g. an equation) and storing the resultant information in a picture, and just including that picture in the resultant file?

* Which of these programs does things like recognize word frequency in order to improve its ability to OCR correctly in ambiguous cases? (Or, alternatively, can be trained to prefer certain words over others.)

* Which of these programs can be &quot;trained&quot; to recognize new symbols and substitute them, for example seeing ║ and OCR&#039;ing it as \norm, or seeing β and OCR&#039;ing it as \beta ?

* Which of these programs can be trained to recognize formatting data and encode that: for example, scan &quot;what _dreams_ may come&quot; and OCR this as &quot;what \begin{boldface}dreams\end{boldface} may come&quot; ?

* Which of these programs can be trained to recognize relative position, for example notice e^x^ and OCR this as e\superscript{x} ?

I am planning to achieve world domination by developing an open-source OCR tool that scans old math texts and converts them to .TeX files which, when compiled, faithfully recover the original text and formatting, but as I currently have exactly none of the skill set necessary to accomplish this, I&#039;d rather get the next closest OCR program created by someone else and tinker with it a bit.

Thanks in advance to anyone who responds!</description>
		<content:encoded><![CDATA[	<p>I have become a bit confused. If anyone wishes to help orient me, it&#8217;s much appreciated:</p>

	<ul>
		<li>Which of these programs is good at recognizing when symbols don&#8217;t make sense to it (e.g. an equation) and storing the resultant information in a picture, and just including that picture in the resultant file?</li>
	</ul>

	<ul>
		<li>Which of these programs does things like recognize word frequency in order to improve its ability to <span class="caps">OCR</span> correctly in ambiguous cases? (Or, alternatively, can be trained to prefer certain words over others.)</li>
	</ul>

	<ul>
		<li>Which of these programs can be &#8220;trained&#8221; to recognize new symbols and substitute them, for example seeing ║ and <span class="caps">OCR</span>&#8217;ing it as norm, or seeing &#946; and <span class="caps">OCR</span>&#8217;ing it as beta ?</li>
	</ul>

	<ul>
		<li>Which of these programs can be trained to recognize formatting data and encode that: for example, scan &#8220;what <em>dreams</em> may come&#8221; and <span class="caps">OCR</span> this as &#8220;what begin{boldface}dreamsend{boldface} may come&#8221; ?</li>
	</ul>

	<ul>
		<li>Which of these programs can be trained to recognize relative position, for example notice e<sup>x</sup> and <span class="caps">OCR</span> this as esuperscript{x} ?</li>
	</ul>

	<p>I am planning to achieve world domination by developing an open-source <span class="caps">OCR</span> tool that scans old math texts and converts them to .TeX files which, when compiled, faithfully recover the original text and formatting, but as I currently have exactly none of the skill set necessary to accomplish this, I&#8217;d rather get the next closest <span class="caps">OCR</span> program created by someone else and tinker with it a bit.</p>

	<p>Thanks in advance to anyone who responds!</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Brendan</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-293169</link>
		<dc:creator>Brendan</dc:creator>
		<pubDate>Wed, 28 Oct 2009 04:56:09 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-293169</guid>
		<description>Probably not of much use or interest to people here, but Acrobat&#039;s OCR for Chinese texts is surprisingly good. It still requires correcting (particularly for texts that mix traditional and simplified characters, and anything featuring Chinese characters plus, e.g., transliterated Sanskrit is going to be a world of pain), but is much, much better than the other options (all...one of them?) for Mac OCR of Chinese. Thanks for the heads-up -- hadn&#039;t known about this before.</description>
		<content:encoded><![CDATA[	<p>Probably not of much use or interest to people here, but Acrobat&#8217;s <span class="caps">OCR</span> for Chinese texts is surprisingly good. It still requires correcting (particularly for texts that mix traditional and simplified characters, and anything featuring Chinese characters plus, e.g., transliterated Sanskrit is going to be a world of pain), but is much, much better than the other options (all&#8230;one of them?) for Mac <span class="caps">OCR</span> of Chinese. Thanks for the heads-up&#8212;hadn&#8217;t known about this before.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: jre</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-293147</link>
		<dc:creator>jre</dc:creator>
		<pubDate>Tue, 27 Oct 2009 21:24:31 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-293147</guid>
		<description>Tesseract, or &quot;tesseract-ocr&quot;, is free (in both senses), works on all platforms and does not have a problem with grayscale or binary images.  I have used it to scan multi-generation copies of wretched quality and gotten good results.  Downside: you need to scan to a tiff or convert a pdf file to tiff.</description>
		<content:encoded><![CDATA[	<p>Tesseract, or &#8220;tesseract-ocr&#8221;, is free (in both senses), works on all platforms and does not have a problem with grayscale or binary images.  I have used it to scan multi-generation copies of wretched quality and gotten good results.  Downside: you need to scan to a tiff or convert a pdf file to tiff.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Jon H</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-293052</link>
		<dc:creator>Jon H</dc:creator>
		<pubDate>Mon, 26 Oct 2009 23:17:23 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-293052</guid>
		<description>&quot;Everyone with Acrobat should know this: there’s an OCR option under the document menu. (No great shakes, but it works.)&quot;

With Acrobat, I found it best to use the option to retain the bitmap text, rather than to replace the bitmap with the OCR&#039;d text. The text is searchable, and copyable, but the original font remains, and the OCR mistakes are hidden. (Which could be a problem, but might not be.)</description>
		<content:encoded><![CDATA[	<p>&#8220;Everyone with Acrobat should know this: there&#8217;s an <span class="caps">OCR</span> option under the document menu. (No great shakes, but it works.)&#8221;</p>

	<p>With Acrobat, I found it best to use the option to retain the bitmap text, rather than to replace the bitmap with the <span class="caps">OCR</span>&#8217;d text. The text is searchable, and copyable, but the original font remains, and the <span class="caps">OCR</span> mistakes are hidden. (Which could be a problem, but might not be.)</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Zora</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-293029</link>
		<dc:creator>Zora</dc:creator>
		<pubDate>Mon, 26 Oct 2009 18:16:27 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-293029</guid>
		<description>A heavy-duty guillotine gets rid of the covers and the spine. A sheet-fed scanner (expensive!) can scan a whole book, both sides of the pages, in less than half an hour. Then OCR the whole batch.

The tools are somewhat expensive, but once the investment is made, conversion from deadtree to e is fast ... assuming a recent printing with clear, crisp print.</description>
		<content:encoded><![CDATA[	<p>A heavy-duty guillotine gets rid of the covers and the spine. A sheet-fed scanner (expensive!) can scan a whole book, both sides of the pages, in less than half an hour. Then <span class="caps">OCR</span> the whole batch.</p>

	<p>The tools are somewhat expensive, but once the investment is made, conversion from deadtree to e is fast &#8230; assuming a recent printing with clear, crisp print.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: John Holbo</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-293021</link>
		<dc:creator>John Holbo</dc:creator>
		<pubDate>Mon, 26 Oct 2009 16:53:50 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-293021</guid>
		<description>&quot;Any tips on turning a bound book into scannable pages?&quot;

Be prepared to break its spine. As to sheet feeding: I&#039;m sure you don&#039;t want to be feeding actual book pages through one of those, even if the whole spine has dissolved and you just have a pile of paper. You could photocopy the whole, fairly quickly, then sheetfeed the photocopies, more slowly. That might produce perfectly good results. I have access to a really high quality flatbed scanner at work and a less high quality flatbed at home. I think flatbed is probably the way to go with old books.</description>
		<content:encoded><![CDATA[	<p>&#8220;Any tips on turning a bound book into scannable pages?&#8221;</p>

	<p>Be prepared to break its spine. As to sheet feeding: I&#8217;m sure you don&#8217;t want to be feeding actual book pages through one of those, even if the whole spine has dissolved and you just have a pile of paper. You could photocopy the whole, fairly quickly, then sheetfeed the photocopies, more slowly. That might produce perfectly good results. I have access to a really high quality flatbed scanner at work and a less high quality flatbed at home. I think flatbed is probably the way to go with old books.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Jon H</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-293017</link>
		<dc:creator>Jon H</dc:creator>
		<pubDate>Mon, 26 Oct 2009 16:34:13 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-293017</guid>
		<description>Any tips on turning a bound book into scannable pages?

I picked up an old scanner with a sheet feeder at a &#039;freecycling&#039; event here at work. Had to buy a semi-compatible power supply, and need to solder it in, but it works, more or less.

Some of my books are starting to turn yellow, so retaining the physical objects isn&#039;t a strong motivator.</description>
		<content:encoded><![CDATA[	<p>Any tips on turning a bound book into scannable pages?</p>

	<p>I picked up an old scanner with a sheet feeder at a &#8216;freecycling&#8217; event here at work. Had to buy a semi-compatible power supply, and need to solder it in, but it works, more or less.</p>

	<p>Some of my books are starting to turn yellow, so retaining the physical objects isn&#8217;t a strong motivator.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: jholbo</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-292992</link>
		<dc:creator>jholbo</dc:creator>
		<pubDate>Mon, 26 Oct 2009 12:42:28 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-292992</guid>
		<description>Hmmm, this isn&#039;t good. Quiggin sent me his PDF for conversion and Readiris couldn&#039;t read it. Total blank It&#039;s the black&amp;white problem I mentioned above, I presume. I have successfully converted lots of PDF&#039;s I&#039;ve found here and there, and I was starting to think that as long as I personally didn&#039;t select b&amp;w while scanning, that other PDF&#039;s floating around would be no problem. But an OCR app that draws a complete blank with any degree of frequency is a serious problem. Not good. (Since 90% of the stuff I want to convert has been scanned by me, it&#039;s not a problem for me. But for other people? Not so good, I expect.)

Fortunately, Acrobat did a pretty serviceable job on the thing Quiggin sent me. Everyone with Acrobat should know this: there&#039;s an OCR option under the document menu. (No great shakes, but it works.)</description>
		<content:encoded><![CDATA[	<p>Hmmm, this isn&#8217;t good. Quiggin sent me his <span class="caps">PDF</span> for conversion and Readiris couldn&#8217;t read it. Total blank It&#8217;s the black&#038;white problem I mentioned above, I presume. I have successfully converted lots of <span class="caps">PDF</span>&#8217;s I&#8217;ve found here and there, and I was starting to think that as long as I personally didn&#8217;t select b&#038;w while scanning, that other <span class="caps">PDF</span>&#8217;s floating around would be no problem. But an <span class="caps">OCR</span> app that draws a complete blank with any degree of frequency is a serious problem. Not good. (Since 90% of the stuff I want to convert has been scanned by me, it&#8217;s not a problem for me. But for other people? Not so good, I expect.)</p>

	<p>Fortunately, Acrobat did a pretty serviceable job on the thing Quiggin sent me. Everyone with Acrobat should know this: there&#8217;s an <span class="caps">OCR</span> option under the document menu. (No great shakes, but it works.)</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: John Holbo</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-292964</link>
		<dc:creator>John Holbo</dc:creator>
		<pubDate>Mon, 26 Oct 2009 00:38:09 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-292964</guid>
		<description>I have also noticed one quirk about Readiris, which I discovered early enough that it&#039;s not going to bother me, but it&#039;s worth mentioning. For test purposes, I scanned some docs in b&amp;w and grayscale. I figured the former would be just as good, and might even be better. And scanning in b&amp;w is slightly faster. To my surprise, the b&amp;w scans didn&#039;t show up at all. They looked fine in any PDF reader. But Readiris couldn&#039;t read them. Since OCR starts by converting grayscale to b&amp;w (I believe), this is sort of an odd bug. I emailed them and a support person asked for a sample problematic file. I haven&#039;t heard back since then (that was a week ago.) So I recommend: scanning in grayscale, or at least testing any b&amp;w setting before using. I haven&#039;t had any trouble with old PDF&#039;s that I didn&#039;t make, but I think those are mostly grayscale, in effect.</description>
		<content:encoded><![CDATA[	<p>I have also noticed one quirk about Readiris, which I discovered early enough that it&#8217;s not going to bother me, but it&#8217;s worth mentioning. For test purposes, I scanned some docs in b&#038;w and grayscale. I figured the former would be just as good, and might even be better. And scanning in b&#038;w is slightly faster. To my surprise, the b&#038;w scans didn&#8217;t show up at all. They looked fine in any <span class="caps">PDF</span> reader. But Readiris couldn&#8217;t read them. Since <span class="caps">OCR</span> starts by converting grayscale to b&#038;w (I believe), this is sort of an odd bug. I emailed them and a support person asked for a sample problematic file. I haven&#8217;t heard back since then (that was a week ago.) So I recommend: scanning in grayscale, or at least testing any b&#038;w setting before using. I haven&#8217;t had any trouble with old <span class="caps">PDF</span>&#8217;s that I didn&#8217;t make, but I think those are mostly grayscale, in effect.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: John Holbo</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-292963</link>
		<dc:creator>John Holbo</dc:creator>
		<pubDate>Mon, 26 Oct 2009 00:23:12 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-292963</guid>
		<description>Hmmm, I have Acrobat with the CS Suite. Is there anyway to get OCR in CS Suite without just going into Acrobat and doing the quick OCR thing. Because that works OK, and may be fine for many people&#039;s purposes. But Readiris works a lot better. I considered going for Finereader instead - and there is a free trial version you can download, I believe. But it&#039;s $200, so I went for the slightly cheaper one. And I&#039;m very happy.

John Quiggin: Send me that PDF you want converted. I&#039;ll send you the RTF conversion that is produced with 1-click. And you see whether you like the results.</description>
		<content:encoded><![CDATA[	<p>Hmmm, I have Acrobat with the <span class="caps">CS </span>Suite. Is there anyway to get <span class="caps">OCR</span> in <span class="caps">CS </span>Suite without just going into Acrobat and doing the quick <span class="caps">OCR</span> thing. Because that works OK, and may be fine for many people&#8217;s purposes. But Readiris works a lot better. I considered going for Finereader instead &#8211; and there is a free trial version you can download, I believe. But it&#8217;s $200, so I went for the slightly cheaper one. And I&#8217;m very happy.</p>

	<p>John Quiggin: Send me that <span class="caps">PDF</span> you want converted. I&#8217;ll send you the <span class="caps">RTF</span> conversion that is produced with 1-click. And you see whether you like the results.</p>
 ]]></content:encoded>
	</item>
	<item>
		<title>By: Zora</title>
		<link>http://crookedtimber.org/2009/10/25/mindhacks-for-fingertips-follow-up-plus-earhacks/comment-page-1/#comment-292961</link>
		<dc:creator>Zora</dc:creator>
		<pubDate>Sun, 25 Oct 2009 23:02:37 +0000</pubDate>
		<guid isPermaLink="false">http://crookedtimber.org/?p=13477#comment-292961</guid>
		<description>The folks at Distributed Proofreaders (16,000 books scanned or harvested) are fond of Abbyy Finereader.</description>
		<content:encoded><![CDATA[	<p>The folks at Distributed Proofreaders (16,000 books scanned or harvested) are fond of Abbyy Finereader.</p>
 ]]></content:encoded>
	</item>
</channel>
</rss>

