PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Following links to an external site

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Following links to an external site

    Hi Guys,

    My customer's site is on v7 and it was working like a charm...
    Until they decided to move their PDFs to an external site, ipaper
    The PDF files have been converted to flipbooks

    Here is an example URL:
    https://viewer.ipaper.io/domtar/ariv...-eng/?page=1#/
    https://viewer.ipaper.io/domtar/ariv...-list/?page=12


    As can be seen, the URLs to these flipbooks don't have an extension, so I ticked Zoom's 'Scan file with no extensions' in the Scan options

    I have read a few topics about following to external sites and tried to setup like the following topic:
    https://www.zoomsearchengine.com/for...te-1-link-deep
    but without any success:
    Could not download file: http://ariva.ca/fr/;https://viewer.i.../domtar/ariva/ (400 HTTP Error)

    I aslo tried setting up a second starting point, but am getting similar results:
    Could not download file: https://viewer.ipaper.io/domtar/ariva/ (File not found)

    While it is possible to setup a branded domain on iPaper, my customer would have to redo all the linking to his flipbooks.
    This would be a backup plan

    How can I correctly setup the Zoom indexer to follow the links to the ipaper site ?

    Thanks,
    Erik

  • #2
    Originally posted by Erik View Post
    I have read a few topics about following to external sites and tried to setup like the following topic:
    https://www.zoomsearchengine.com/for...te-1-link-deep
    but without any success:
    Could not download file: http://ariva.ca/fr/;https://viewer.i.../domtar/ariva/ (400 HTTP Error)
    You have entered the suggested Base URL as your Start Spider URL so this won't work.

    The Base URL is modified by clicking on the "More" button next to Start Spider URL (which should be simply "http://ariva.ca/fr/") and then "Edit"
    And then you modify the Base URL field to:
    Code:
    http://ariva.ca/fr;https://viewer.ipaper.io/domtar/ariva/
    Originally posted by Erik View Post
    I aslo tried setting up a second starting point, but am getting similar results:
    Could not download file: https://viewer.ipaper.io/domtar/ariva/ (File not found)
    This is not a valid URL (if you go to the URL in your browser you will get a 404 error). So this approach will not work.

    Let us know how you go with the above.
    --Ray
    Wrensoft Web Software
    Sydney, Australia
    Zoom Search Engine

    Comment


    • #3
      Thanks Ray, it works perfectly well now, I can see ipaper results in my search

      Comment

      Working...
      X