Type in the query words in a column. When certain words designate something
what you mean only when being used together (e.g. "US" and "economy"),
put them in one line separated by spaces.
One can put up to 5 words into a line. There can be up to 30 lines.
The required number of query lines depends on the processing mode (usually
not less than 3).
Important: the system considers the order of lines as corresponding
to their relative value in user's query.
It is also possible to reflect your theme by a query more perfectly.
For instructions use Help option in the pop-up menu or the Query syntax
below.
After starting the program, this window will display new queries, automatically
created by the system in addition to your initial query.
Query syntax:
Each query line can be preceded by indication of its presence being obligatory:
# | - obligatory in every paragraph selected |
% | - obligatory for the starting group of paragraphs |
+ | - obligatory present in the first paragraph selected |
Query words which you consider having the same meaning should be separated
with "&".
Words which are to be found together and in the same order should be
separated with "_".
Project's Manager lets to easily change between existing projects and
corresponding queries, so as to create the new ones, rename or delete queries.
Thematic project uses a separate textual database, to which an unlimited
number of queries can be addressed. Under each query name you save query
lines, processing mode, language of data processing and results.
After each start of the program the Default project is created, accompanied
with a query Untitled.
Tuning of the system. Shifting to the right will make the system follow
the more strict requirements for including and arranging of the information
items in the output material. This increases the degree of relevancy but
results in less amount of the output information. Shifting to the left
will weaken requirements for data processing. This is worth choosing when
the database contains little information relevant to your query.
Automatic decrease of requirements for data processing. Use it when
expecting that data processing will take too long time. In this mode the
system will test the database content: if there is insufficient information
relevant to the query, the program will gradually decrease the requirements
for selecting and arranging of the information items.
Indicate the suitable data processing time. (Note, that this is a time
only for working with your query, and it doesn't count the time needed
for certain data preprocessing in case when database is analyzed for the
first time).
A set of data data processing parameters, optimized for individual needs;
can be selected here when available.
Language of data processing. Selection automatically activates corresponding
linguistic facilities, including the Stop words list and parsing.
This is a nonobligatory used list of words (terms) occurring at
high frequencies within a given thematic field. You can use it in order
to augment relevancy and informativeness of the output material. Type in
the terms in a column.
Indicate, whether to use context in current data processing.
Indicate, whether the context terms should be included into newly created
queries. Presence of common terms in queries generally results in increasing
amount of the output material accompanied by decrease of its informativeness.
Choose a directory containing data files to be processed.
By default it is a $DATBASE in the directory of a current project.
Stop:
List of commonly used, auxiliary and syntactic words of a given language,
which the system should not consider as meaningful.
Thematic:
List of domain-specific concepts (thesaurus), which can be used for
indexing of the processed texts, eliminating the necessity for the stop
words list. This option is not available in current version.
An editable list of stop words which are ignored when working with every
project.
Edit current project's vocabulary
An editable list of stop words which are ignored when working with current
project.
List of domain-specific concepts (thesaurus), which can be used for
indexing of the processed texts, eliminating the necessity for the stop
words list. This option is not available in current version.
Start a data retrieval module, which automatically addresses the Internet
search engines.
Not available in current version of the program.
Start the data processing. Be sure to press this button after all other
options are properly selected.
Pause the data processing. This lets to edit the current query, created
by the system and displayed in the Query window. In order to resume processing
press this button repeatedly (as it changes its look to Continue).
Finish the data processing.
Clear the results file before processing a new query.
Opening of the external browser for viewing the results.
List of created thematic clusters. Select any in the List to view the
corresponding query in the Query window. Simultaneously, a Key Words window
will display words which helped to select material in this cluster, and
in the Suggested words frame will be shown additionally revealed words
characteristic for this content.
Processing messaging window. Here are listed in particular:
- the names of screened files;
- error messages.
In course of the working process here will be displayed:
- message "Step", followed by the numbers indicating accordingly:
the number of the query being processed, total amount of processing time
(the database preprocessing period not counted), remaining processing time
(regarding the time limit preset by the user);
- message "Read", followed by the numbers indicating accordingly:total
amount of analyzed files and the number of screened paragraphs.
- message "Found", followed by the numbers indicating accordingly:
total amount of selected paragraphs (included in the output material),
number of words, suggested for including into new query.