Nutch Shorterm Goals

  1. Ability to use regular expressions for URL substitutions.
  2. Allow users to to search using url:Store/View/Product/1001
  3. Faster crawling of websites that look like one (1) IP address.
  4. Some sort of templating engine for creating search results pages. Maybe use Velocity?

2 Comments »

  1. Sam said,

    September 9, 2004 @ 11:11 pm

    sweet!

  2. Brian said,

    September 10, 2004 @ 9:07 am

    Very cool Luke! We’ll end up using Nutch yet. ;-)

RSS feed for comments on this post · TrackBack URI

Leave a Comment