Status Message | Description |
---|---|
INITIATED | Once Verity has checked that the URL is properly formed and does not already exist in the database, the request is passed to the Verity classification systems and the status updates to INITIATED. |
PROCESSING | The Verity classification system is processing the text and images on the specified page. |
PROCESSED | The URL has been processed and the Verity analysis JSON is available. The analysis results have been stored. |
ERROR | Page processing has been attempted and failed. The page URL is recorded in the Error Cache for 1 hour. If another request to process the same URL is received within 1 hour, Verity will return an Error status (unless the ignoreCache flag is enabled). After 1 hour, the ERROR status is cleared and Verity will process a new request for the URL. Several different conditions may result in an ERROR status message:
|
NOT_SUPPORTED | The language of the page is not supported (see Language Support Grid ). This status message may also be returned if Verity is unable to process the requested website. |
INSUFFICIENT_CONTENT | Verity’s content extraction processes cannot extract sufficient relevant content from a page to adequately perform classification tasks across text and imagery. |
INVALID | The HTTP URL request may be malformed, for example:
|
PAGE_CONTENT_EXTRACTION_FAILED_WITH_403_FORBIDDEN | A website has blocked our web crawler from downloading content. |
PAGE_CONTENT_EXTRACTION_FAILED_WITH_404_NOT_FOUND | Verity’s web crawler was not able to locate any content for the provided url. |
PAGE_CONTENT_EXTRACTION_FAILED_WITH_500_INTERNAL_SERVER_ERROR | There was an unknown issue received from the webpage when an attempt to crawl was made. Note: The PCE attempts up to three times to extract web content. |
PAGE_CONTENT_EXTRACTION_FAILED_WITH_4XX | A generic 4XX response was received during a crawl attempt by PCE. |
PAGE_CONTENT_EXTRACTION_FAILED_WITH_5XX | A generic 5XX response was received during a crawl attempt by PCE. |
Status messages returned by the page API.
Manage space
Manage content
Integrations