Who can use this feature?
- Organization administrators, workspace administrators, and workspace users.
- Available for Accelerate and Enterprise.
To scan with Desktop Crawler App, make sure you've downloaded and installed the application first.
The Desktop Crawler App can scan up to 100 pages with a crawl depth of 25.
On this page:
- Generate an API key
- Add specific URLs to allowlist
- Log in to the Desktop Crawler App
- Configure and run the scan with Desktop Crawler App
- Understand configuration settings for crawling
- iframes and Desktop Crawler App
- Troubleshoot Desktop Crawler App
Generate an API key
Each time you log in to the Desktop Crawler App, you need an API key.
To generate an API key:
- Navigate to organizational level.
- Under Manage, select Tools & Integrations, and then API.
- Select Add API token.
- Add required details, and then select Generate API key.
- Save the API key to a secure file or password manager to be able to access it in the future. If you do not have access to your API key, you must generate a new one each time you use the app.
You will receive notifications prior to your API key expiration. For details about API keys, refer to Create an API token to use Level Access API.
Add specific URLs to allowlist
DCA will access some resources on the public internet for normal function. If your network security blocks access to these resources, you will need to work with your security team to add them to an allowlist.
Add the following URLs to the allowlist:
- api2.amplitude.com: Used to track user interaction via Amplitude.
- amp.levelaccess.net: DCA pulls best practices from Accessibility Management Platform (AMP) to add details to the results data.
- Instance of Level Access Platform that DCA is configured to push results to: You must be able to reach your instances of the platform. DCA can't push results to an instance it can’t reach. The initial setup where you choose a workspace and digital assets, won’t happen if DCA can’t reach the configured instance.
Log in to the Desktop Crawler App
Before you log in and start scanning, download Desktop Crawler App for the platform, obtain your organization URL and API key, and update the allowlist.
To log in to Desktop Crawler App:
- Open the app.
- In the Organization URL * field, enter the URL of the organization. For example, https://ACME.hub.essentia11y.com/. If you need to find the organization URL, refer to View organization details.
- In the API Key* field, enter the API key you generated.
- Select Log in.
You are now logged in and ready to start scanning with Desktop Crawler App. To proceed, you must select digital assets you want to scan, and configure the scan settings.
Note: The workspace contains the digital asset you want to use to store your scan results. If you have access to more than one workspace, the first one will be chosen by default. You can select another one from the dropdown menu.
Configure and run the scan with Desktop Crawler App
To scan with the Desktop Crawler App:
- On the Digital Assets screen, from the Workspace dropdown, select the workspace with the digital asset you want to scan. A list of assets you can access in this organization is displayed.
- Find the digital asset, for example website, you want to scan, and then select Create scan.
- Configure the settings for the scan. Enter or select the appropriate values:
- Scan title.
- Website URL.
- Scan tag.
- Maximum number of pages.
- Crawl depth.
- Skip URL:
- # endings
- ? endings
- Add webpages to exclude
- Select Authenticate to validate scan setup form entries if you are scanning a protected site, behind a security wall. Skip this step if you are scanning a public site.
When you select the Authenticate button, DCA launches a Chrome browser window and loads the URL specified in DCA as the starting URL. It asks you to log in if the URL is protected. If needed, complete the login. - Select Run scan.
The application starts crawling and scanning the pages using specified settings. It discovers, scans, and submits results in a sequential manner. Do not close the application. This will interrupt the scanning process, resulting in only partial results being available on the platform.
Scanning will stop when:
- All discovered possible URLs are scanned, or
- The maximum number of pages is reached.
Once the scan is complete, you can view the results in the platform or run a new scan.
Understand configuration settings for crawling
Before you can run a scan, you need to specify values for the following settings:
| Setting | Value |
| Scan title |
Name of the scan report that will be created. Includes an appended time stamp. Example Scan title: Edge Not Scan |
| Website URL |
The page on which the spider will start. Start location. Example: www.mycompany.com/foo
|
| Scan tag | Scan tags help you to categorize, filter, and find past scans. Refer to Scan tag best practices. |
| Maximum number of pages |
The number of pages to scan or test. The maximum value is 100 pages. Example: 50 |
| Crawl depth |
The depth of a webpage in the website’s hierarchical structure. Indicates how many sub-levels you'd like to test. Depth aligns with the number of backslashes in a URL. The minimum is 1. The maximum is 25. Example of Crawl depth value: 5
|
| Skip URL |
Both Skip options default to off, which is the same as the default for Level Access Platform scans and monitoring.
|
| Add webpages to exclude | Ignores URLs we want to skip when running a scan. Select the checkbox, and add comma separated URLs you want DCA to ignore when running a scan. For example, if you want to ignore the Logout link/page, add its URL to the list of pages to ignore. You can also specify these URLs as regular expressions. |
iframes and Desktop Crawler App
Many websites use iframes to embed another page inside a page, for example, to embed a social media widget. Desktop Crawler App always includes findings from iframes in the results. Level Access Platform defaults to excluding them.
Note the following guidelines:
- Best practice is to always have iFrame content on.
- Issues from iframes are combined with the parent page results.
- In the platform, when an issue is flagged inside an iframe, the issue is also given special notation in the locator (||). That is, element paths from iframes use special notation to show where they came from.
Troubleshoot Desktop Crawler App
Why did the scan fail?
Scans can fail if the website is not reachable, it takes too long to respond, or if the computer loses network connectivity.
My scan did not reach crawl depth I specified?
The Desktop Crawler App can scan up to 100 pages with a crawl depth of 25. You must ensure that Crawl depth aligns with the number of backslashes in a URL. The minimum is 1. The maximum is 25.
Why do my findings and scores differ between the platform and DCA?
The DCA and platform page selection algorithms might choose different sets of pages to test. For example, if a site has 100 pages but only 10 are scanned, the chosen set of pages might vary between the platform and DCA. Consequently, the findings and scores derived from those findings might differ between the platform and DCA. However, they will be identical if you scan all the pages, because page selection no longer influences the outcome.
Comments
0 comments
Article is closed for comments.