Information extraction engine service

URL:

*OR*

Text:

Concept amount:


Service Documentation

Service Parameters

Parameter Name

Description

Data Type

Default Value

pageurl

Provides a URL for the resource the Information Extractor should attempt to retrieve and then process. This parameter is optional; however, if pageurl is not specified, then doctext should be specified. If doctext is non-zero length, then pageurl is ignored. If the information extractor is not able to retrieve the resource located at pageurl, then it returns an HTTP 403 error to the calling client.

RFC 1738-compliant Uniform Resource Locator

Null

doctext

The raw text content to be processed by the information extractor. Should be either plain text, HTML, or XML content.

String

Null

depth

The concept depth or density the information extractor should shoot for when mining concepts. The higher the value, the more concepts are extracted.

Integer

100

singlewordconcepts

When this parameter is used, the information extractor returns single-word concepts in addition to multi-word concepts. When this parameter is not present or is zero-length, the information extractor mines multi-word concepts only.

Variant (any non-zero-length value)

Null