TWINSSE

Parameter Configuration File

This file contains the parameter configuration file details that are read and used by the executables generated in the subprojects. A single configuration file contains the parameters for a single database configuration (including both plain and the corresponding meta-database).

Configuration File Format

<plain_database_file_path>
<number_of_threads_to_use_with_plain_database>
<number_of_keywords_in_the_plain_database>
<number_of_maximum_document_identifiers_for_a_keyword>
<Bloom_filter_size_as_a_power_of_2_value>
<number_of_bits_required_to_address_the_Bloom_filter>
<meta_database_file_path>
<number_of_threads_to_use_with_meta_database>
<number_of_metakeywords_in_the_meta_database>
<number_of_maximum_document_identifiers_for_a_metakeyword>
<Bloom_filter_size_as_a_power_of_2_value_for_the_meta-database>
<number_of_bits_required_to_address_the_Bloom_filter_for_the_meta-database>

The configuration file in this directory for the database supplied is shown here.

../databases/db6k.dat
24
6043
1809
131072
17
../databases/meta_db6k.dat
24
32621
1170
2097152
21

Detailed Specification

Each line contains a parameter value/string. Do not put whitespace. Try to avoid whitespaces in filepaths.

Line 1: (string) Path to plain database file containing the inverted index of keyword and documents identifiers

Line 2: (integer) Number of threads to use while processing (or executing any setup or search) over the plain database. The conjunctive and the DNF experiments use this value.

Line 3: (integer) Number of unique keywords in the plain database. This is necessary for allocating memory.

Line 4: (integer) The maximum number of document identifiers for a keyword in the plain database. This is necessary for allocating memory.

Line 5: (integer) The size of the Bloom filter (number of addresses/elements) for the plain database. The conjunctive and the DNF experiments use this value. This value is a power of 2, just larger than the total number of unique keyword-id pairs in the plain database.

Line 6: (integer) Number of bits required to address the above Bloom filter. For example, if the Bloom filter size is $2^n$, then number of bits required to address is $n$.

Line 7: (string) Path to the metakeyword database file containing the metakeywords and the associated document identifiers.

Line 8: (integer) Number of threads to use while processing (while executing setup and search) metakeyword database. The disjunctive and the CNF experiments use this value.

Line 9: (integer) Number of metakeywords in the metakeyword database. This is necessary for allocating memory.

Line 10: (integer) The maximum number of document identifiers for a metakeyword in the metakeyword database. This is necessary for allocating memory.

Line 11: (integer) The size of the Bloom filter (number of addresses/elements) for the metakeyword database. The disjunctive and the CNF experiments use this value. This value is a power of 2, just larger than the total number of unique metakeyword-id pairs in the metakeyword database.

Line 12: (integer) Number of bits required to address the above Bloom filter. For example, if the Bloom filter size is $2^n$, then number of bits required to address is $n$.