Package edu.splat.wikicat.xmlparsers

Class Summary
CategoryExtractor Extracts categories from Wikipedia by analyzing Category articles and then outputing "cat1\tcat2" where cat1 is a parent of cat2.
ExtractArticlesWithCategory Given a file containing categories with one category per line, extracts articles with at least one of the categories.