Class Reference
%iKnow.Queries.SourceAPI
Server:basexml
Instance:SOAXML
User:UnknownUser
 
-
  [BASEXML] >  [%iKnow] >  [Queries] >  [SourceAPI]
Private  Storage

class %iKnow.Queries.SourceAPI extends %iKnow.Queries.AbstractAPI

Main Query API for retrieving sources and related information.

Inventory

Parameters Properties Methods Queries Indices ForeignKeys Triggers
12 34


Summary

Methods
%%OIDGet %AddToSaveSet %ClassIsLatestVersion %ClassName
%ConstructClone %DispatchClassMethod %DispatchGetModified %DispatchGetProperty
%DispatchMethod %DispatchSetModified %DispatchSetMultidimProperty %DispatchSetProperty
%Extends %GetParameter %IsA %IsModified
%New %NormalizeObject %ObjectModified %OriginalNamespace
%PackageName %RemoveFromSaveSet %SerializeObject %SetModified
%ValidateObject GetByCrcIds GetByCrcMask GetByCrcs
GetByDomain GetByEntities GetByEntityIds GetByEquivalentIds
GetByEquivalents GetByPathIds GetCountByCrcIds GetCountByCrcMask
GetCountByCrcs GetCountByDomain GetCountByEntities GetCountByEntityIds
GetCountByPathIds GetExternalId GetFullReference GetLanguages
GetSimilar GetSourceId GetSummary GetSummaryDirect
GetTopLanguage


Parameters

• parameter GetByCrcIdsRT = "srcId:%Integer,externalId:%String";
• parameter GetByCrcMaskRT = "srcId:%Integer,externalId:%String";
• parameter GetByCrcsRT = "srcId:%Integer,externalId:%String";
• parameter GetByDomainRT = "srcId:%Integer,externalId:%String";
• parameter GetByEntitiesRT = "srcId:%Integer,externalId:%String";
• parameter GetByEntityIdsRT = "srcId:%Integer,externalId:%String";
• parameter GetByEquivalentIdsRT = "srcId:%Integer,externalId:%String";
• parameter GetByEquivalentsRT = "srcId:%Integer,externalId:%String";
• parameter GetByPathIdsRT = "srcId:%Integer,externalId:%String";
• parameter GetLanguagesRT = "language:%String,totalConfidence:%Numeric,sentenceCount:%Integer";
• parameter GetSimilarRT = "srcId:%Integer,extId:%String,percentageMatched:%Numeric,percentageNew:%Numeric,nbOfEntsInRefSrc:%Integer,nbOfEntsInCommon:%Integer,nbOfEntsInSimSrc:%Integer,score:%Numeric";
• parameter GetSummaryRT = "sentId:%Integer,sentenceValue:%String,sentenceIsTruncated:%Boolean";

Methods

• classmethod GetByCrcIds(ByRef result, domainid As %Integer, crcidlist As %List, page As %Integer = 1, pagesize As %Integer = 10, filter As %iKnow.Filters.Filter = "", setop As %Integer = $$$UNION) as %Status

Retrieves all sources containing the given CRC ids.

See also GetByEntities for a description of the parameters.

• classmethod GetByCrcMask(ByRef result, domainid As %Integer, master As %String = $$$WILDCARD, relation As %String = $$$WILDCARD, slave As %String = $$$WILDCARD, page As %Integer = 1, pagesize As %Integer = 10, filter As %iKnow.Filters.Filter = "") as %Status

Retrieves all sources containing a CRC satisfying the given CRC Mask.

See also GetByEntities for a description of the parameters.

• classmethod GetByCrcs(ByRef result, domainid As %Integer, crclist As %List, page As %Integer = 1, pagesize As %Integer = 10, filter As %iKnow.Filters.Filter = "", setop As %Integer = $$$UNION) as %Status

Retrieves all sources containing the given CRCs.

See also GetByEntities for a description of the parameters.

• classmethod GetByDomain(ByRef result, domainid As %Integer, page As %Integer = 1, pagesize As %Integer = 10, filter As %iKnow.Filters.Filter = "") as %Status

Returns a list of all sources within a domain, optionally filtered through a %iKnow.Filters.Filter object passed in as filter.

• classmethod GetByEntities(ByRef result, domainid As %Integer, entitylist As %List, page As %Integer = 1, pagesize As %Integer = 10, filter As %iKnow.Filters.Filter = "", setop As %Integer = $$$UNION) as %Status

This method will retrieve all sources containing any (if = $$$UNION) or all (if = $$$INTERSECT) of the entities supplied through entitylist.

The scope of this query can be limited through supplying a %iKnow.Filters.Filter object for the filter parameter to restrict the result to those sources satisfying the filter criteria.

Results are ordered by decreasing source id.

• classmethod GetByEntityIds(ByRef result, domainid As %Integer, entityidlist As %List, page As %Integer = 1, pagesize As %Integer = 10, filter As %iKnow.Filters.Filter = "", setop As %Integer = $$$UNION) as %Status

Retrieves all sources containing the given entity ids.

See also GetByEntities for a description of the parameters.

• classmethod GetByEquivalentIds(ByRef result, domainid As %Integer, entityidlist As %List, setid As %Integer, page As %Integer = 1, pagesize As %Integer = 10, filter As %iKnow.Filters.Filter = "") as %Status
• classmethod GetByEquivalents(ByRef result, domainid As %Integer, entitylist As %List, setid As %Integer, page As %Integer = 1, pagesize As %Integer = 10, filter As %iKnow.Filters.Filter = "") as %Status
• classmethod GetByPathIds(ByRef result, domainid As %Integer, pathidlist As %List, page As %Integer = 1, pagesize As %Integer = 10, filter As %iKnow.Filters.Filter = "") as %Status

Retrieves all sources containing the given path ids.

See also GetByEntities for a description of the parameters.

• classmethod GetCountByCrcIds(domainid As %Integer, crcidlist As %List, filter As %iKnow.Filters.Filter = "", setop As %Integer = $$$UNION, Output sc As %Status = $$$OK) as %Integer

Retrieves the number of sources containing the given CRC ids.

See also GetByEntities for a description of the parameters.

• classmethod GetCountByCrcMask(domainid As %Integer, master As %String = $$$WILDCARD, relation As %String = $$$WILDCARD, slave As %String = $$$WILDCARD, filter As %iKnow.Filters.Filter = "", Output sc As %Status = $$$OK) as %Integer

Retrieves the number of sources containing a CRC satisfying the given CRC Mask.

See also GetByEntities for a description of the parameters.

• classmethod GetCountByCrcs(domainid As %Integer, crclist As %List, filter As %iKnow.Filters.Filter = "", setop As %Integer = $$$UNION, Output sc As %Status = $$$OK) as %Integer

Retrieves the number of sources containing the given CRCs.

See also GetByEntities for a description of the parameters.

• classmethod GetCountByDomain(domainid As %Integer, filter As %iKnow.Filters.Filter = "", Output sc As %Status = $$$OK) as %Integer

This method returns the total number of sources in the specified domain, optionally filtered through a %iKnow.Filters.Filter object passed in as filter.

• classmethod GetCountByEntities(domainid As %Integer, entitylist As %List, filter As %iKnow.Filters.Filter = "", setop As %Integer = $$$UNION, Output sc As %Status = $$$OK) as %Integer

Retrieves the number of sources containing the given entities.

See also GetByEntities for a description of the parameters.

• classmethod GetCountByEntityIds(domainid As %Integer, entityidlist As %List, filter As %iKnow.Filters.Filter = "", setop As %Integer = $$$UNION, Output sc As %Status = $$$OK) as %Integer

Retrieves the number of sources containing the given entity ids.

See also GetByEntities for a description of the parameters.

• classmethod GetCountByPathIds(domainid As %Integer, pathidlist As %List, filter As %iKnow.Filters.Filter = "", Output sc As %Status = $$$OK) as %Integer

Retrieves the number of sources containing the given path ids.

See also GetByEntities for a description of the parameters.

• classmethod GetExternalId(domainid As %Integer, srcId As %Integer, Output sc As %Status = $$$OK) as %String

Retrieves the external ID for the source identified by srcId.

A negative srcId is considered to refer to a Virtual Source.

• classmethod GetFullReference(domainId As %Integer, srcId As %Integer, Output sc As %Status = $$$OK) as %String

Retrieves the Full Reference for the source identified by srcId.

A negative srcId is considered to refer to a Virtual Source.

• classmethod GetLanguages(ByRef result, domainid As %Integer, sourceid As %Integer) as %Status

This method returns, for a given source, the list of languages the iKnow Automatic Language Identification algorithm has identified for the sentences it contains. For each language, it will list the sum of ALI confidence ratings when assigning the language to a sentence, plus the total number of sentences the language was assigned to. If ALI was disabled when indexing this source, confidence ratings will be set to 0.

A negative Source ID is interpreted as a Virtual Source.

• classmethod GetSimilar(ByRef result, domainid As %Integer, sourceid As %Integer, page As %Integer = 1, pagesize As %Integer = 10, filter As %iKnow.Filters.Filter = "", algorithm As %String = $$$SIMSRCSIMPLE, algorithmParams As %List = "", blackListIds As %List = "") as %Status

Retrieves those sources that have similar content to the one specified through the sourceId parameter. Similarity is expressed by the (approximated) ratio between the length of the source and the overlap with the starting source, based on the chosen algorithm (according to the algorithm-specific algorithmParams).

Note: whenever possible, use a filter object to limit the search space for this query to perform adequately.

If the supplied Source ID is negative, it is treated as a Virtual Source. While this allows looking for sources similar to a Virtual Source, the query will not verify against other Virtual Sources and they will therefore not appear in the results of this query.

The blackListIds parameter allows a user to select one or more BlackLists of which the elements should be completely ignored in the similarity algorithm. Blacklisted elements will be ignored both for selecting similar candidates AND the calculation of their similarity score.

The following algorithms are currently available:

  • $$$SIMSRCSIMPLE: basic algorithm simply looking at entities (or CRCs or CCs) present in the reference source.
    Parameters:
    1. targetType As %String - either "ent", "crc" or "cc" (case insensitive, defaults to "ent"). Please note this implies the result columns will represent the corresponding targets, despite being named "nbOfEnt***"
  • $$$SIMSRCEQUIVS: similar to $$$SIMSRCSIMPLE, but also taking into account equivalent terms.
    Parameters:
    1. equivSetId As %Integer - equivalency set ID
  • $$$SIMSRCDOMENTS starts looking for similar sources based on dominant terms in the reference source.
    Parameters:
    1. strict As %Boolean - whether or not to select only sources for which at least one of the reference source's dominant concepts are also dominant in a similar source. (defaults to 1)
• classmethod GetSourceId(domainid As %Integer, externalId As %String, Output sc As %Status = $$$OK) as %Integer

Retrieves the source ID for the source identified by externalId.

• classmethod GetSummary(ByRef result, domainid As %Integer, sourceid As %Integer, length As %Integer = 5) as %Status

This method returns the most relevant sentences of the required document, in their original order. The number of sentences returned is controlled by the length parameter.

A negative Source ID is interpreted as a Virtual Source.

• classmethod GetSummaryDirect(domainId As %Integer, sourceId As %Integer, length As %Integer = 5, Output isTruncated As %Boolean = 0, Output sc As %Status = $$$OK, separator As %String = " ... ") as %String

This is a utility wrapper method around GetSummary, returning the summary sentences as a single concatenated string.

The isTruncated output parameter indicates whether or not the returned string is complete or was truncated to avoid MAXSTRING errors. This can happen when either an individual sentence surpassed the maximum string length or when the concatenation of these sentences surpassed it.

• classmethod GetTopLanguage(domainid As %Integer, sourceid As %Integer, Output totalConfidence As %Numeric = "", Output numberOfSentences As %Integer = "", Output sc As %Status = $$$OK) as %String

This method returns the language of a given source, calculated as the one with the highest total confidence rating for the individual sentences in the source. This total confidence, as well as the number of sentences to which this language was assigned is provided through output variables totalConfidence and numberOfSentences.

A totalConfidence of 0 indicates ALI was switched off when indexing this source.

A negative Source ID is interpreted as a Virtual Source.