After running parse.py this Google search will not contain a search
phrase (since it will be excluded) because it contains the
term, "gadgets".
Furthermore, this will contain only 2 keywords ("gizmos" and
"sprockets") since "gadgets" will be excluded.
This parameter should be specified as a comma-separated list of strings
if there is more than one term that you wish to exlucde.
KNOWN_PAGES
A hit is considered any file that is accessed. A page is considered to be
any page aka document that is accessed. The difference is
subtle but useful (since it may be desirable to view stats of image files
in the same manner that you view html files-- since an html file can
contain dozens of images).
You can specify which files you consider to be pages by listing their
suffixes in a comma separated list: doc, html
Additionally, if you change these entries after you have parsed some log
files you will want to run the update_known_pages.py script that is in the
Scratchy/scripts directory. This script updates the appropriate database
tables such that reports produced with this data will reflect the
currently recognized KNOWN_PAGES.