<html>
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
  </head>
  <body>
    <p>funny you should ask - as I am just finishing digitizing 40 years
      of journals from Watervliet (NY)Shaker Village and have digitized
      a large number of resources<br>
      as part of Shakerpedia and other projects.<br>
      <br>
      SO this is not just a pick software question -- it's more about
      the overall project design:<br>
         Just a few parts:<br>
              most higher level scanners include good OCR systems --
      ABBYY is included with many PC/MAC systems<br>
              Are the journals sheet feed-able  or can they be even if
      cutting the bindings.<br>
              Or even better, has someone else digitized or will help
      digitize - just are Archive.org that has both a major archive and
      infrastructure for digitizing.<br>
              Is there the staff to handle this or what cost has to be
      covered.<br>
              What is the most effective platform, there are reason to
      get into linux systems - such as Tesseract <br>
               Once it's digitized, how will it be search -- the most
      common system for such online use is Elasticsearch, which you can
      run on AWS or almost any cloud platform.<br>
    </p>
    <p>As you can tell - there is a lot more to that question than just
      software -- there are few comments above - if you want to discuss
      this more, email off-list</p>
    <p>Stay well - Rich<br>
    </p>
    <div class="moz-cite-prefix">On 5/5/2021 3:53 PM, Joanna Campe via
      Hidden-discuss wrote:<br>
    </div>
    <blockquote type="cite"
cite="mid:CACW46y_neyPg=zhPAR0g8YYODMFEcMj=_VzosCkpmQJQ64ML4A@mail.gmail.com">
      <meta http-equiv="content-type" content="text/html; charset=UTF-8">
      <div dir="ltr">
        <div class="gmail_default"
          style="font-family:garamond,serif;font-size:large">Hi
          everyone,<br>
          <br>
          I hope you are all safe and well.<br>
          <br>
          We would like to digitize our archival hardcopy magazines, and
          we are looking for the best option. Does anyone have
          experience with this and can make a recommendation for OCR
          software? </div>
        <div class="gmail_default"
          style="font-family:garamond,serif;font-size:large"><br>
        </div>
        <div class="gmail_default"
          style="font-family:garamond,serif;font-size:large">We have
          tried Adobe Acrobat Pro and a couple others, but are having
          some difficulty recognizing text that is printed over images.</div>
        <div class="gmail_default"
          style="font-family:garamond,serif;font-size:large"><br>
        </div>
        <div class="gmail_default"
          style="font-family:garamond,serif;font-size:large">Important
          features are searchable PDF creation in a magazine format. We
          are using an Epson Perfection V500 Plus scanner, if that
          matters.</div>
        <div class="gmail_default"
          style="font-family:garamond,serif;font-size:large"><br>
        </div>
        <div class="gmail_default"
          style="font-family:garamond,serif;font-size:large">Your
          recommendations are much appreciated!<br>
          <br>
          My best,<br>
          <br>
          Joanna</div>
        <div class="gmail_default"
          style="font-family:garamond,serif;font-size:large"><br>
        </div>
        <div>
          <div dir="ltr" class="gmail_signature"
            data-smartmail="gmail_signature">
            <div dir="ltr">
              <div>
                <div dir="ltr">
                  <div>
                    <div dir="ltr">
                      <div dir="ltr">
                        <div dir="ltr">
                          <div dir="ltr"><font size="4" face="garamond,
                              serif">Joanna Campe<br>
                              Executive Director<br>
                              Remineralize the Earth<br>
                              152 South Street<br>
                              Northampton, MA 01060 USA </font></div>
                          <div dir="ltr"><br>
                          </div>
                          <div dir="ltr"><font size="4" face="garamond,
                              serif">Tel: 413-563-9938</font>
                            <div><font size="4" face="garamond, serif">Email: </font><a
                                href="mailto:jcampe@remineralize.org"
                                style="font-family:Times;font-size:18px"
                                target="_blank" moz-do-not-send="true">jcampe@remineralize.org</a><font
                                size="4" face="garamond, serif"><br>
                              </font><font size="4" face="garamond,
                                serif"><a
                                  href="http://www.remineralize.org/"
                                  target="_blank" moz-do-not-send="true">http://www.remineralize.org</a>
                                <div
                                  style="display:inline-block;width:16px;height:16px"> </div>
                              </font></div>
                            <div><font size="4" face="garamond, serif">
                                <div
                                  style="display:inline-block;width:16px;height:16px"><br>
                                </div>
                              </font></div>
                            <div><font size="4" face="garamond, serif"
                                color="#000000"><b>Book</b></font></div>
                            <div><font size="4" face="garamond, serif"
                                color="#000000">Geotherapy: Innovative
                                Methods of Soil Fertility Restoration,
                                Carbon Sequestration, and Reversing CO2
                                Increase</font></div>
                            <div><font size="4" face="garamond, serif"><a
href="http://www.crcpress.com/product/isbn/9781466595392"
                                  style="color:rgb(17,85,204)"
                                  target="_blank" moz-do-not-send="true">http://www.crcpress.com/product/isbn/9781466595392</a> </font></div>
                            <div><font size="4" face="garamond, serif"><br>
                              </font></div>
                            <div><font size="4" face="garamond, serif">Please
                                join and support us on <b><a
                                    href="https://www.patreon.com/RTE"
                                    target="_blank"
                                    moz-do-not-send="true">Patr</a></b></font><b><a
                                  href="https://www.patreon.com/RTE"
                                  target="_blank" moz-do-not-send="true"><span
style="font-family:garamond,serif;font-size:large">e</span><span
                                    style="font-family:garamond,serif;font-size:large">on</span></a></b></div>
                            <div><font size="4" face="garamond, times
                                new roman, serif"><a
                                  href="https://www.patreon.com/RTE"
                                  target="_blank" moz-do-not-send="true">https://www.patreon.com/RTE</a></font><font
                                size="4" face="garamond, serif"><br>
                              </font></div>
                          </div>
                        </div>
                      </div>
                    </div>
                  </div>
                </div>
              </div>
            </div>
          </div>
        </div>
      </div>
      <br>
      <fieldset class="mimeAttachmentHeader"></fieldset>
      <pre class="moz-quote-pre" wrap="">_______________________________________________
Hidden-discuss mailing list - home page: <a class="moz-txt-link-freetext" href="http://www.hidden-tech.net">http://www.hidden-tech.net</a>
<a class="moz-txt-link-abbreviated" href="mailto:Hidden-discuss@lists.hidden-tech.net">Hidden-discuss@lists.hidden-tech.net</a>

You are receiving this because you are on the Hidden-Tech Discussion list.
If you would like to change your list preferences, Go to the Members
page on the Hidden Tech Web site.
<a class="moz-txt-link-freetext" href="http://www.hidden-tech.net/members">http://www.hidden-tech.net/members</a>
</pre>
    </blockquote>
    <pre class="moz-signature" cols="72">-- 
Rich Roth
CEO TnR Global

Bio and personal blog: <a class="moz-txt-link-freetext" href="http://rizbang.com">http://rizbang.com</a>
Building the really big sites:      <a class="moz-txt-link-freetext" href="http://www.tnrglobal.com">http://www.tnrglobal.com</a>
Small/Soho business in the PV:        <a class="moz-txt-link-freetext" href="http://www.hidden-tech.net">http://www.hidden-tech.net</a>
Places to meet for business:        <a class="moz-txt-link-freetext" href="http://www.meetmewhere.com">http://www.meetmewhere.com</a>
And for Arts and relaxation:
<a class="moz-txt-link-freetext" href="http://TarotMuertos.com">http://TarotMuertos.com</a> - Artistic Tarot Deck
   <a class="moz-txt-link-freetext" href="http://www.welovemuseums.com">http://www.welovemuseums.com</a>
   <a class="moz-txt-link-freetext" href="http://www.artonmytv.com/">http://www.artonmytv.com/</a>
Helping move the world:             <a class="moz-txt-link-freetext" href="http://www.earththrives.com">http://www.earththrives.com</a></pre>
  </body>
</html>