Oracle® Database PL/SQL Packages and Types Reference 11g Release 2 (11.2) Part Number E10577-03 |
|
|
View PDF |
The DBMS_XDBT
package provides a convenient mechanism for administrators to set up a CONTEXT
index on the Oracle XML DB hierarchy. The package contains procedures to create default preferences, create the index and set up automatic synchronization of the CONTEXT
index
The DBMS_XDBT
package also contains a set of package variables that describe the configuration settings for the index. These are intended to cover the basic customizations that installations may require, but is by no means a complete set.
See Also:
Oracle XML DB Developer's GuideThis chapter contains the following topics:
Overview
Operational Notes
The DBMS_XDBT
package can be used in the following fashion:
Customize the package to set up the appropriate configuration.
Use the DROPPREFERENCES Procedure to drop any existing index preferences
Create new index preferences using the CREATEPREFERENCES Procedure procedure
Create the CONTEXT
index using the CREATEINDEX Procedure procedure
Set up automatic synchronization of the index using the CONFIGUREAUTOSYNC Procedure
The DBMS_XDBT package can be customized by using a PL/SQL procedure or an anonymous block to set the relevant package variables, configuration settings, and then execute the procedures. A more general approach would be to introduce the appropriate customizations by modifying this package in place, or as a copy. The system must be configured to use job queues, and the jobs can be viewed through the USER_JOBS
catalog views. This section describes the configuration settings, or package variables, available to customize the DBMS_XDBT
package.
Table 164-1 General Indexing Settings for Customizing DBMS_XDBT
Parameter | Default Value | Description |
---|---|---|
|
|
Name of the |
|
|
Tablespace used by tables and indexes comprising the |
|
|
Memory used by index creation and SYNC; less than or equal to the |
|
|
The log file used for |
Table 164-2 Filtering Settings for Customizing DBMS_XDBT
Parameter | Default Value | Description |
---|---|---|
|
|
List of mime types that should not be indexed. |
|
|
List of mime types that do not need to use the INSO filter. Use this for text-based documents. |
|
|
Name of the filter preference. |
Table 164-3 Stoplist Settings for Customizing DBMS_XDBT
Parameter | Default Value | Description |
---|---|---|
|
|
Name of the stoplist. |
|
|
List of stopwords, in excess of |
Table 164-4 Sectioning and Section Group Settings for Customizing DBMS_XDBT
Parameter | Default Value | Description |
---|---|---|
|
|
Default sectioner. Use |
|
|
Name of the section group. |
Table 164-5 Other Index Preference Settings for Customizing DBMS_XDBT
Parameter | Default Value | Description |
---|---|---|
|
|
Name of the datastore preference |
|
|
Name of the storage preference. |
|
|
Name of the wordlist preference. |
|
|
Name of the default lexer preference. |
Table 164-6 SYNC (CONTEXT Synchronization) Settings for Customizing DBMS_XDBT
Table 164-7 DBMS_XDBT Package Subprograms
Subprogram | Description |
---|---|
Configures the |
|
Creates a USER datastore preference for the |
|
Creates a filter preference for the |
|
Creates the |
|
Creates a lexer preference for the |
|
Creates preferences required for the |
|
Creates a storage preference for the |
|
Creates a section group for the |
|
Creates a wordlist preference for the |
|
Creates a stoplist for the |
|
Drops any existing preferences |
This procedure sets up jobs for automatic SYNCs of the CONTEXT
index.
Syntax
DBMS_XDBT.CONFIGUREAUTOSYNC;
Usage Notes
The system must be configured for job queues for automatic synchronization. The jobs can be viewed using the USER_JOBS
catalog views
The configuration parameter AutoSyncPolicy
can be set to choose an appropriate synchronization policy.
The synchronization can be based on one of the following:
Sync Basis | Description |
---|---|
SYNC_BY_PENDING_COUNT |
The SYNC is triggered when the number of documents in the pending queue is greater than a threshold (See the MaxPendingCount configuration setting). The pending queue is polled at regular intervals (See the CheckPendingCountInterval configuration parameter) to determine if the number of documents exceeds the threshold. |
SYNC_BY_TIME |
The SYNC is triggered at regular intervals. (See the SyncInterval configuration parameter). |
SYNC_BY_PENDING_COUNT_AND_TIME |
A combination of both of the preceding options. |
This procedure creates a user datastore preference for the CONTEXT
index on the XML DB hierarchy.
Syntax
DBMS_XDBT.CREATEDATASTOREPREF;
Usage Notes
The name of the d
atastore preference can be modified; see the DatastorePref
configuration setting.
The default USER
datastore procedure also filters the incoming document. The DBMS_XDBT
package provides a set of configuration settings that control the filtering process.
The SkipFilter_Types
array contains a list of regular expressions. Documents with a mime type that matches one of these expressions are not indexed. Some of the properties of the document metadata, such as author, remain unindexed.
The NullFilter_Types
array contains a list of regular expressions. Documents with a mime type that matches one of these expressions are not filtered; however, they are still indexed. This is intended to be used for documents that are text-based, such as HTML, XML and plain-text.
All other documents use the INSO filter through the IFILTER API.
This procedure creates a NULL
filter preference for the CONTEXT
index on the XML DB hierarchy.
Syntax
DBMS_XDBT.CREATEFILTERPREF;
Usage Notes
The name of the filter preference can be modified; see FilterPref
configuration setting.
The USER
datastore procedure filters the incoming document; see CREATEDATASTOREPREF Procedurefor more details.
This procedure creates the CONTEXT
index on the XML DB hierarchy.
Syntax
DBMS_XDBT.CREATEINDEX;
Usage Notes
The name of the index can be changed; see the IndexName
configuration setting.
Set the LogFile
configuration parameter to enable ROWID
logging during index creation.
Set the IndexMemory
configuration parameter to determine the amount of memory that index creation, and later SYNCs, will use.
This procedure creates a BASIC lexer
preference for the CONTEXT
index on the XML DB hierarchy.
Syntax
DBMS_XDBT.CREATELEXERPREF;
Usage Notes
The name of the lexer
preference can be modified; see LexerPref
configuration setting. No other configuration settings are provided.
MultiLexer
preferences are not supported.
Base letter translation is turned on by default.
This procedure creates a set of default preferences based on the configuration settings.
Syntax
DBMS_XDBT.CREATEPREFERENCES;
This procedure creates a section group for the CONTEXT
index on the XML DB hierarchy.
Syntax
DBMS_XDBT.CREATESECTIONGROUPPREF;
Usage Notes
The name of the section group can be changed; see the SectiongroupPref
configuration setting.
The HTML sectioner is used by default. No zone sections are created by default. If the vast majority of documents are XML, consider using the AUTO_SECTION_GROUP
or the PATH_SECTION_GROUP
; see the SectionGroup
configuration setting.
This procedure creates a stoplist for the CONTEXT
index on the XML DB hierarchy.
Syntax
DBMS_XDBT.CREATESTOPLISTPREF;
Usage Notes
The name of the stoplist can be modified; see the StoplistPref
configuration setting.
Numbers are not indexed.
The StopWords
array is a configurable list of stopwords. These are meant to be stopwords in addition to the set of stopwords in CTXSYS.DEFAULT_STOPLIST.
This procedure creates a BASIC_STORAGE
preference for the CONTEXT
index on the XML DB hierarchy.
Syntax
DBMS_XDBT.CREATESTORAGEPREF;
Usage Notes
The name of the storage preference can be modified; see the StoragePref
configuration setting.
A tablespace can be specified for the tables and indexes comprising the CONTEXT
index; see the IndexTablespace
configuration setting.
Prefix and Substring indexing are not turned on by default.
The I_INDEX_CLAUSE
uses key compression.
This procedure creates a word list preference for the CONTEXT
index on the XML DB hierarchy.
Syntax
DBMS_XDBT.CREATEWORDLISTPREF;
Usage Notes
The name of the word list preference can be modified; see the WordlistPref
configuration setting. No other configuration settings are provided.
FUZZY_MATCH
and STEMMER
attributes are set to AUTO
(auto-language detection)
This procedure drops any previously created preferences for the CONTEXT
index on the XML DB hierarchy.
Syntax
DBMS_XDBT.DROPPREFERENCES;