> ## Documentation Index
> Fetch the complete documentation index at: https://docs.toolip.io/llms.txt
> Use this file to discover all available pages before exploring further.

# How to Set Up Toolip With Norconex

> Flexible, region-specific, reliable web crawling? Norconex with Toolip is the answer

## What is Norconex?

**Norconex** is a Java-based, open-source web crawler focused on flexibility and customization. It supports various file types, integrates with third-party tools, and uses XML for configuration. Key features include data collection, parsing, and committing, OCR, JavaScript crawling, and robots.txt compliance—ideal for indexing, content gathering, and website optimization. Together with **Toolip** you will bypass geo-restrictions, avoid IP bans, and ensure more reliable, region-specific data collection during large-scale web crawling.

Prefer an enterprise-grade guide? Learn more about Oculus Proxies <a href="https://docs.oculusproxies.com/integration-guides/norconex" target="_blank" rel="noopener">Norconex Integration Guide</a>.

<Tip>
  Targeting search engines like Google, Bing, or Yandex requires a specialized proxy to ensure stable access and avoid blocks. **Toolip’s Search Engine ISP Proxies** are designed specifically for this, providing reliable performance where standard proxies may fail. If your proxy test isn’t working on search engines, switching to **Search Engine ISP Proxies** can resolve the issue.
</Tip>

## How to Set Up Toolip With Norconex

<Steps>
  <Step title="Install Java">
    Download and install the appropriate version of [**Java(JDK)**](https://www.oracle.com/java/technologies/javase/jdk17-archive-downloads.html) for your operating system.

    <Frame as="div">
      <img src="https://mintcdn.com/toolip-d3ca8ce9/WYOxfv19t56gmnRb/integration-guides/img/norconex1.png?fit=max&auto=format&n=WYOxfv19t56gmnRb&q=85&s=985a422d608261c63c295e68fa8012a9" alt="" width="1161" height="824" data-path="integration-guides/img/norconex1.png" />
    </Frame>
  </Step>

  <Step title="Environment Settings">
    Search for **Environment Variables** in the Windows search bar and select **Edit the system environment variables**. Under **User variables for \[USERNAME]**.
    Click **New**.

    <Frame as="div">
      <img src="https://mintcdn.com/toolip-d3ca8ce9/WYOxfv19t56gmnRb/integration-guides/img/norconex2.png?fit=max&auto=format&n=WYOxfv19t56gmnRb&q=85&s=48f342bb0dec58040a67fcc5869a030b" alt="" width="627" height="592" data-path="integration-guides/img/norconex2.png" />
    </Frame>
  </Step>

  <Step title="Set JAVA_HOME">
    In the **Variable name** field, enter `JAVA_HOME`.
    In the **Variable value** field, paste the path to your `JDK installation directory`.
    Then click **OK**.

    <Frame as="div">
      <img src="https://mintcdn.com/toolip-d3ca8ce9/WYOxfv19t56gmnRb/integration-guides/img/norconex3.png?fit=max&auto=format&n=WYOxfv19t56gmnRb&q=85&s=e85729e54a872db61403346fd3359425" alt="" width="659" height="170" data-path="integration-guides/img/norconex3.png" />
    </Frame>
  </Step>

  <Step title="Install Norconex">
    Download and install the **latest version of [Norconex](https://opensource.norconex.com/crawlers/web/)** from the official website.

    <Frame as="div">
      <img src="https://mintcdn.com/toolip-d3ca8ce9/WYOxfv19t56gmnRb/integration-guides/img/norconex4.png?fit=max&auto=format&n=WYOxfv19t56gmnRb&q=85&s=c3cd560e6e3c1c52faf6f21e2d7198de" alt="" width="1538" height="723" data-path="integration-guides/img/norconex4.png" />
    </Frame>
  </Step>

  <Step title="Extract and Configure">
    Create a folder for Norconex (e.g., C:\Norconex).
    Right-click the downloaded .zip file and extract its contents into this folder.

    Then, navigate to:
    `Norconex → Examples → collector-http-config-reference.xml`
    Right-click the file and open it in a code editor (e.g., Notepad).

    Inside the file, locate the `<httpFetchers>` and `</httpFetchers>` tags.
    Insert the required configuration code between these tags.

    ```bash theme={null}
    <httpFetcherFactory class="com.norconex.collector.http.fetch.impl.GenericHttpFetcherFactory">
    <proxySettings>
    <host>proxy.toolip.io</host>
    <port>12321</port> <-Replace with your port
    <scheme>http</scheme> <-or “https” if you bought TLS exits
    <credentials>
    <username>your-username</username> <-Replace with your username
    <password>your-password</password> <-Replace with your password
    </credentials>
    <urlFilter>.sample\.co\.jp.</urlFilter> <-Apply proxy only for matching URLs
    </proxySettings>
    </httpFetcherFactory>
    ```
  </Step>

  <Step title="Verify Configuration">
    Ensure the XML structure reflects your changes correctly.
    It should look like this:

    <Frame as="div">
      <img src="https://mintcdn.com/toolip-d3ca8ce9/WYOxfv19t56gmnRb/integration-guides/img/norconex5.png?fit=max&auto=format&n=WYOxfv19t56gmnRb&q=85&s=3c3303a13c8ee35d7de1a8d3d3f454ce" alt="" width="943" height="327" data-path="integration-guides/img/norconex5.png" />
    </Frame>
  </Step>
</Steps>
