any of the keywords of this phrase that were in the EXCLUDE_SEARCH_TERMS
list will not be recorded.
An example:
A visitor arrives from a Google after searching for "sprockets gizmos
gadgets". Your config file contains EXCLUDE_SEARCH_TERMS: 'gizmos'.
After running parse.py this Google search will not contain a search
phrase (since it will be excluded) because it contains the
term, "gadgets".
Furthermore, this will contain only 2 keywords ("gizmos" and
"sprockets") since "gadgets" will be excluded.
This parameter should be specified as a comma-separated list of strings
if there is more than one term that you wish to exlucde.
Example 1: excluding a single term
EXCLUDE_SEARCH_TERMS: gizmos
Example 2: excluding multiple terms
EXCLUDE_SEARCH_TERMS: gizmos, gadget, foo bar, sprocket
KNOWN_PAGES
A hit is considered any file that is accessed. A page is considered to be
any page aka document that is accessed. The difference is
subtle but useful (since it may be desirable to view stats of image files
in the same manner that you view html files-- since an html file can
contain dozens of images).
You can specify which files you consider to be pages by listing their
suffixes in a comma separated list: doc, html
Additionally, if you change these entries after you have parsed some log
files you will want to run the update_known_pages.py script that is in the
Scratchy/scripts directory. This script updates the appropriate database
tables such that reports produced with this data will reflect the
currently recognized KNOWN_PAGES.