Nutch Shorterm Goals

September 7, 2004

  1. Ability to use regular expressions for URL substitutions.
  2. Allow users to to search using url:Store/View/Product/1001
  3. Faster crawling of websites that look like one (1) IP address.
  4. Some sort of templating engine for creating search results pages. Maybe use Velocity?