This directory contains the sample database files for the experiments. The plain databases files (parsed indexes) are supplied, and the meta-databases are generated from utility codes present in the database subdirectory of respective projects or codes in the precision subdirectory.
Details of the databases.
db6k.dat Number of keywords: 6043, Number of keyword-id pairs: 80901, Number of maximum ids per kw: 1809, Number of unique ids: 9690
db22k.dat Number of keywords: 22087, Number of keyword-id pairs: 407753, Number of maximum ids per kw: 3668, Number of unique ids: 25001
db133k.dat Number of keywords: 133958, Number of keyword-id pairs: 8212090, Number of maximum ids per kw: 75478, Number of unique ids: 509719
(Due to size restriction on Github, only the db6k.dat and meta_db6k.dat are supplied in the databases directory)
Additional databases files are available on Google Drive, which can be parsed and used in the same way as of these files.