[Hidden-tech] Problem with OCR from Adobe PDF

Rich rich at on-the-net.com
Tue Mar 2 09:03:08 EST 2010


That is odd - I have 2 printer/scanners from HP and both came with the 
ScanPro software with OCR.
Might want to check the box - or with HP

On 3/2/2010 9:00 AM, George Forman wrote:
> Sorry, I meant to add that the only image to ocr functionality I have 
> is within Adobe.  I have no stand alone OCR software, at least none 
> that I can launch, even though HP ScanPro says that this functionality 
> comes with the printer.
>
> I presume I will invest in OCR software soon.  Thanks,
> George
>
> On Mar 2, 2010, at 8:43 AM, Rich wrote:
>
>> I have been scanning a number of my older papers (in all sorts of 
>> formats, from simple typed, to book chapters and newspaper articles)
>> and have generally found that scanning to word documents is very 
>> often a bust.
>>
>> The ocr engine tries to deal with what it sees as sections using all 
>> sorts of word sections and
>> format changes - when it is just confused by visual imperfections in 
>> the document.
>>
>> What I ended up doing is scanning to text and then importing into 
>> word, you then start with much cleaner text.
>> You just use the File/Open menu to open a text document in Word and 
>> use 'save as' with the format
>> as 'Word' document.
>>
>> Rich
>>
>> On 3/2/2010 7:31 AM, George Forman wrote:
>>>
>>> Good Morning Hidden Techies,
>>>
>>> My question deals with my attempts to scan a text document and then 
>>> convert the scan to editable text.  I have created a .doc file but 
>>> the text is stuck in blocks that make it impossible to move text 
>>> from one block to another or to have the text build as one 
>>> continuous flow.  I wonder if there is some command I can use to 
>>> "flatten" the text document in Microsoft word, something that 
>>> eliminates these funny constraints.  Here is what I have done.
>>>
>>> 1. Scan the page using my HP All in One printer, creating a PDF file 
>>> directly from the scan.
>>> 2. Open the PDF in Acrobat Professional
>>> 3. Use OCR feature in Acrobat to convert print to editable text
>>> 4. Export as a .doc file.
>>>
>>> I work on a Mac using Snow Leopard as the operating system.
>>>
>>> Any ideas?  My gratitude in advance,  George
>>>
>>>
>>> George Forman, Ph.D.
>>> Emeritus Professor, UMass, Amherst
>>> President, Videatives, Inc.
>>> 19 The Hollow
>>> Amherst, Massachusetts
>>> 01002
>>> Phone: 413 256 8846
>>> Fax: 413 230 3130
>>> www.videatives.com <http://www.videatives.com>
>>> /See What Children Know™/
>>>
>>>
>>>
>>>
>>> _______________________________________________
>>> Hidden-discuss mailing list - home page:http://www.hidden-tech.net
>>> Hidden-discuss at lists.hidden-tech.net
>>>
>>> You are receiving this because you are on the Hidden-Tech Discussion list.
>>> If you would like to change your list preferences, Go to the Members
>>> page on the Hidden Tech Web site.
>>> http://www.hidden-tech.net/members
>>
>> -- 
>> Rich Roth
>> CEO On-the-net
>>
>> Bringing you complex online systems since the net was young
>> http://www.tnrglobal.com  - Blog:http://www.rizbang.com
>> Helping move the world:http://www.earththrives.com
>>
>>      
>

-- 
Rich Roth
CEO On-the-net

Bringing you complex online systems since the net was young
http://www.tnrglobal.com - Blog: http://www.rizbang.com
Helping move the world:  http://www.earththrives.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.hidden-tech.net/pipermail/hidden-discuss/attachments/20100302/b9099761/attachment.html 


Google

More information about the Hidden-discuss mailing list