Viewing Content Scans

Learn how to review and maintain your content scans.


Overview

The Content Scans page provides an overview of the scans added to the DryvIQ Platform. The Scan name column displays the name assigned to the scan when it was created and lists the scan types selected. The Connection column identifies the platform, connection, and account used to access the platform, while the Path column identifies the location chosen for the scan (the target directory). The Status column indicates when the scan last ran. 

Viewing Child Scans

Only primary scans display on the All Scans page. When adding a content scan, you can use a document set or make the location a mapped location. In both instances, all scanning takes place at the child level rather than at the primary scan level. The primary scan runs only to determine the child scans that need to be created. For mapped locations, a child scan is created for each folder that is one level under the path selected for the scan. For document sets, a child scan is created for each path saved in the document set.

Mapped locations are identified by a folder icon in front of the path and “Mapped location” under the path. Document sets will not have a connection listed and will have “Document set” as the path. You can click the New Window icon in the Path column for these scans to view the child scans. Refer to "Viewing Child Scans" for more information. 

Searching Scans

The Refine button allows you to search the scans based on specific criteria. The search works as an “AND” statement when multiple options are applied. This is a case-sensitive search, so the search text must match the content precisely. 

Search Field

The Search field in the menu filters the search options only; it does not perform a search on the text entered. 

 
Refine Option  Description
Path Search for a scan based on the exact path. This information is displayed in the Location column. This is an exact search, so you must enter the exact path to return results.
Storage platform Search for scans on a specific storage platform. You will select the platform from a list of available storage platforms.
Impersonation user Search for a scan based on the user being impersonated. Note that not all platforms support impersonation. This is an exact search, so the entry must match the user exactly.
Connection name Search for a scan based on the connection name, which is the first line of text displayed in the Connection column. This is an exact search, so the entry must match the connection name exactly.
Connection user Search for a scan based on the user account used to create the connection. This is the second line of text displayed in the Connection column. This is an exact search, so the entry must match the user exactly.
Scan status

Search for a scan based on the scan status. If you apply a scan filter and scans within that status are missing, you may need to run the scans again. This is a known issue with how the scan execution status is calculated in the DryvIQ backend. The scan status options are:

  • Canceled: A user manually triggered and then manually stopped the scan.
  • Completed: The scan has run to its scheduled stop policy. (Stop policy configuration is only available through the REST API for content scans.) If you run a scan in “Completed” status, the stop policy will be reset.
  • Failed: The scan encountered a failure that caused it to stop processing.
  • Idle: The scan is not actively running. Scans are idle if they have never been run, require manual execution, or are scheduled but not yet in progress. The status will display “no scan” for scans that have not run yet.  
  • Paused: The scan was triggered by a schedule but was stopped manually by a user.
  • Running: The scan is currently in progress.
  • Queued: A scan is triggered to run, but no runner is available, or the max concurrent job setting has been met.
  • Warning: The scan is complete but contains warnings or item failures.
Scan date Search for a scan based on the date the scan last ran. Select the mode (between, on, after, or before) and the date you want to use.
Has executions? View only scans that have been executed. Select True to view scans that had at least one scan executed. Select False to view scans that have not been executed yet.
Has child scans? View only scans that have child scans. These are scans for mapped locations and document sets. Select True to view just these scans. Select False to exclude these scans.
Is a Document set? View only scans for document sets. Select True to view just document set scans. Select False to exclude document set scans. 

Starting Scans Manually

If you did not set a schedule for the scan when you created it, you must start it manually when you want it to run. Even if you've assigned a schedule to a scan, you can start it manually whenever you need it to run outside the scheduled time. You have the option of starting individual scans or selecting multiple scans. 

Staring Individual Scans

Use the Scan button to start a scan manually. The scan information identifies the percentage of the scan that has been completed and how many files have been scanned, allowing you to monitor the progress. Once a scan is running, the button will become a Pause button. You can use this button to stop the scan as needed. 

Click Scan to start a scan.
Monitor the scan progress. 

 

Starting Multiple Scans

You can also start multiple scans at once if needed. If scans are set to run on a schedule, manually starting them will override the schedule and start the scan. When the scan is complete, it will continue to run as scheduled.  

  1. Select the box that displays when you hover on the row.

  2. As you select scans to start, the label for the Start scans button will change to reflect the number of selected scans. Once you finish making your selections, click the Start # scans button.

  3. You are prompted to confirm you want to start the selected number of scans. Click OK to continue (or Cancel to cancel the action).

  4. DryvIQ begins starting the selected scans and displays the progress. Do not refresh the page or close the browser while the scans are starting, as this will interrupt the process and cause failures.

  5. You will see a message indicating all the scans have started. Click Close to close the modal. 

Understanding Scan Progress

The Status column displays when the last scan ran for each location and when it is scheduled to run next. If a scan is running, this column will display a progress bar to indicate the progress. If scans are running for multiple locations, each location displays a progress bar that updates every minute until the scan is complete.

Some platforms report an accurate content estimate that DryvIQ can use to calculate the scan progress. For these platforms, the scan progress will reflect the actual percentage of the scan progression. These scan percentages are approximations and may fluctuate based on the information provided by the platform (where applicable) or the amount of content being scanned. For example, if DryvIQ determines an estimate is wrong, it will recalculate the progress, which may decrease the progress from its previous level. Progress will then begin to increase again as the scan continues. 

SharePoint Online Estimates

While SharePoint Online does provide a count estimate that DryvIQ leverages to determine the scan progress, this estimate is not accurate if there are subdirectories in the source path. In this instance, the scan progress may recalculate. 

 

For platforms that don’t provide content estimates, DryvIQ will display the following percentages based on the scan phase currently being executed:  

  • 20%: The scan is initializing and preparing to run.
  • 30%: The scan is determining changes to the content.
  • 40%: The scan is executing. This is when it is actively reading content and compiling the statistics for the reporting. During this phase, DryvIQ will update the number of files scanned, if any.
  • 80%: The scan is finalizing.
  • 100%: The scan is complete. The Scan status will display 100% for one minute after the scan completes and before the last scan date displays.

For failed scans, you can click the View Activity log link to go to the Activity page to review the logged activity for the selected scan. If the scan is part of a mapped location or data set, the page displays activity for all child scans included in the mapped location or data set. A pill displays the scan ID for the selected scan or parent scan for your reference. You can use the Reset button to clear the selected scan and view all scan activity again as needed. The activity may help you understand what caused the scan to fail, so you can correct any issues that can be addressed before attempting to rerun the scan. 

Viewing Content

You can quickly view the content for any location. Click the ellipses (…) at the end of the row and click View Content from the menu that displays. You will be taken to the Content page, filtered to display just the contents for the selected location. For mapped locations, the content will include all child locations. 

Viewing Insights

You can quickly view the insights for any location. Note that the insights page displayed depends on the last insights you viewed. Therefore, go to the insights, select the page, and then come here to select the location. Click the ellipses (…) at the end of the row and click View Insights from the menu that displays. You will be taken to the last insights page you viewed, and the charts will be filtered to display the insights for the selected location. This option is not available for the primary scan for mapped locations since there is no information to view for these scans. Insights can be viewed through the child scans.

Duplicating a Scan

Duplicating a scan allows you to create a new scan based on the settings for an existing scan. Click the ellipses (…) at the end of the row and click Duplicate Scan from the menu that displays. The Scan Setup wizard will open and automatically populate with the settings from the duplicated scan. You must select the data source you want to use for the scan and adjust any settings if needed. 

Schedules for Document Sets and Mapped Locations

When duplicating the primary scan for a document set or mapped location, the schedule defaults to scanning every six hours. Custom schedules are applied only to the child scans. If you prefer the custom schedule used for the child scans for a document set or mapped location, you should duplicate one of the child scans instead of the primary scan so that you do not need to edit the schedule.

 

Renaming a Scan

You can edit the scan name assigned to a scan at any time. Click the ellipses (…) at the end of the row and click Rename Scan from the menu that displays. This will open the Rename Scan modal. Edit the scan name and click OK to save the changes.

Editing Scan Types

You can update the scan types assigned to the scan. This allows you to add, remove, or adjust the scan types. Click the ellipses (…) at the end of the row and click Edit Scan Types from the menu that displays. This will open the Edit Scan Types modal. The options on this modal are the same options available when creating a content scan. Select the scan types you want to add and clear the ones you want to remove. Once the edits are saved, the new scan types will be applied to the next scan. Scans will behave as follows based on the selected data source type.

  • Location Scan: The scan only looks for new files. Existing content will not be rescanned or updated. Only newly identified items will be scanned using the updated scan types.
  • Mapped Location Scan:  The scan will continue to create child scans as needed. Existing items for child scans will not be rescanned or updated. Only newly identified content will be scanned using the updated scan types.
  • Document Set Scan: The scan will continue to create child scans as needed. Existing items for child scans will not be rescanned or updated. Only newly identified items will be scanned using the updated scan types. 

Resetting A Scan

You can reset a scan to reset its tracking, allowing files to be rescanned as part of the next scan run. Resets affect the tracking data only; it does not remove content from the data catalog. Click the ellipses (…) at the end of the row and click Reset scan from the menu that displays. Reset scans will behave as follows based on the selected data source type.

  • Location Scan: The reset forces a rescan of the entire location and republishes all entities to the data catalog.
  • Mapped Location Scan: The reset will rescan the primary location to create child scans as needed. The child scans will be reset and rescanned so the new entities can be published for all items.
  • Document Set: The reset will rescan to create child scans as needed. 

Managing Scan Schedules

You can update a scan schedule or add a schedule to a scan at any time. Click the ellipses (…) at the end of the row and click Manage Schedule from the menu that displays. This will open the Manage Schedule modal. The scheduling options here are the same as those available when adding a schedule during content scan creation. Refer to “Adding a Content Scan” for more information. 

Primary Scan Schedules

The primary scan for mapped locations has a default schedule that runs every six hours. The schedule defined when the scan was created is assigned to the child scans and can be edited on the Child Scans page. 

 

Removing a Scan

You can remove a scan if you no longer want the corresponding location or document set to be evaluated. Click the ellipses (…) at the end of the row and click Remove Scan from the menu that displays. Click "Yes" when prompted to confirm that you want to remove the scan. 

Sorting the Content Scans Grid

By default, the scans are listed in the order they were created. You can sort the locations based on any column in the grid. Clicking a column header will sort that column in ascending order. An up arrow will appear on the column to indicate the ascending sort order. Clicking the column header again will sort the column in descending order. A down arrow will appear on the column to indicate the descending sort order. Clicking the column header a third time will use the default sort order. The selected sort will persist until you change the sort order or choose to sort a different column.

Natural Sorting

DryvIQ uses natural sorting. Learn more about how natural sorting works here