PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

PDFs: Searching for Hyperlinks within PDF documents

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • PDFs: Searching for Hyperlinks within PDF documents

    Hi,
    Introduction:
    I'm using Zoom 5 Enterprise, which is our search engine for our HTML sites.
    Everything is working fine there.

    Question:
    I want to use Zoom to search within PDF documents, to find the Hyperlinks within the PDF documents.
    Reason why: We're changing Content Mgmt System vendors, and our PDFs have embedded links to URLs that are generated by the CMS.
    For the new CMS, new hyperlinks are created.
    So, we have to look in every one of our 10,000 PDF files to find the embedded hyperlinks.

    Example:
    A PDF has this sentence as part of the total document: (hyperlink TEXT in italics).
    This is a link to another document CLICK HERE TO BRING UP ANOTHER DOCUMENT.

    The phrase
    CLICK HERE TO BRING UP ANOTHER DOCUMENT.

    has an embedded hyperlink underneath it, of this format:
    <a href="CLICK HERE TO BRING UP ANOTHER DOCUMENT" >http://www.shpdata.com/Public/index.asp</a>


    So, i can't search for http:// because that isn't text in the PDF.
    Instead, i need to search the embedded anchor tags "underneath" the text.

    Can i use Zoom 5.0 to find these embedded anchor tag hyperlinks within a PDF document?

    Thank You.
    Bob Hangsterfer
    Bob Hangsterfer

  • #2
    Originally posted by BobHank View Post
    The phrase
    CLICK HERE TO BRING UP ANOTHER DOCUMENT.

    has an embedded hyperlink underneath it, of this format:
    <a href="CLICK HERE TO BRING UP ANOTHER DOCUMENT" >http://www.shpdata.com/Public/index.asp</a>
    I'm guessing you probably meant:

    <a href="http://www.shpdata.com/Public/index.asp" >CLICK HERE TO BRING UP ANOTHER DOCUMENT</a>
    Zoom does not provide any means to search for link URLs. Nor does the PDF plugin have the ability to retrieve links from PDFs. So, in short, the answer is no.
    --Ray
    Wrensoft Web Software
    Sydney, Australia
    Zoom Search Engine

    Comment

    Working...
    X