[Hidden-tech] Problem with OCR from Adobe PDF
Rich
rich at on-the-net.com
Tue Mar 2 09:03:08 EST 2010
That is odd - I have 2 printer/scanners from HP and both came with the
ScanPro software with OCR.
Might want to check the box - or with HP
On 3/2/2010 9:00 AM, George Forman wrote:
> Sorry, I meant to add that the only image to ocr functionality I have
> is within Adobe. I have no stand alone OCR software, at least none
> that I can launch, even though HP ScanPro says that this functionality
> comes with the printer.
>
> I presume I will invest in OCR software soon. Thanks,
> George
>
> On Mar 2, 2010, at 8:43 AM, Rich wrote:
>
>> I have been scanning a number of my older papers (in all sorts of
>> formats, from simple typed, to book chapters and newspaper articles)
>> and have generally found that scanning to word documents is very
>> often a bust.
>>
>> The ocr engine tries to deal with what it sees as sections using all
>> sorts of word sections and
>> format changes - when it is just confused by visual imperfections in
>> the document.
>>
>> What I ended up doing is scanning to text and then importing into
>> word, you then start with much cleaner text.
>> You just use the File/Open menu to open a text document in Word and
>> use 'save as' with the format
>> as 'Word' document.
>>
>> Rich
>>
>> On 3/2/2010 7:31 AM, George Forman wrote:
>>>
>>> Good Morning Hidden Techies,
>>>
>>> My question deals with my attempts to scan a text document and then
>>> convert the scan to editable text. I have created a .doc file but
>>> the text is stuck in blocks that make it impossible to move text
>>> from one block to another or to have the text build as one
>>> continuous flow. I wonder if there is some command I can use to
>>> "flatten" the text document in Microsoft word, something that
>>> eliminates these funny constraints. Here is what I have done.
>>>
>>> 1. Scan the page using my HP All in One printer, creating a PDF file
>>> directly from the scan.
>>> 2. Open the PDF in Acrobat Professional
>>> 3. Use OCR feature in Acrobat to convert print to editable text
>>> 4. Export as a .doc file.
>>>
>>> I work on a Mac using Snow Leopard as the operating system.
>>>
>>> Any ideas? My gratitude in advance, George
>>>
>>>
>>> George Forman, Ph.D.
>>> Emeritus Professor, UMass, Amherst
>>> President, Videatives, Inc.
>>> 19 The Hollow
>>> Amherst, Massachusetts
>>> 01002
>>> Phone: 413 256 8846
>>> Fax: 413 230 3130
>>> www.videatives.com <http://www.videatives.com>
>>> /See What Children Know™/
>>>
>>>
>>>
>>>
>>> _______________________________________________
>>> Hidden-discuss mailing list - home page:http://www.hidden-tech.net
>>> Hidden-discuss at lists.hidden-tech.net
>>>
>>> You are receiving this because you are on the Hidden-Tech Discussion list.
>>> If you would like to change your list preferences, Go to the Members
>>> page on the Hidden Tech Web site.
>>> http://www.hidden-tech.net/members
>>
>> --
>> Rich Roth
>> CEO On-the-net
>>
>> Bringing you complex online systems since the net was young
>> http://www.tnrglobal.com - Blog:http://www.rizbang.com
>> Helping move the world:http://www.earththrives.com
>>
>>
>
--
Rich Roth
CEO On-the-net
Bringing you complex online systems since the net was young
http://www.tnrglobal.com - Blog: http://www.rizbang.com
Helping move the world: http://www.earththrives.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.hidden-tech.net/pipermail/hidden-discuss/attachments/20100302/b9099761/attachment.html
More information about the Hidden-discuss
mailing list