Friday, November 13, 2009

Amazon Feeds Parser: Parsing Huge XML files

Amazon Feeds are huge and they are needed to locally search and browse their catalog as Amazon still does not allow more than 1 call per second to their PA-API. In any case even if you decide to go for the PA-API directly things like ordering a search result for a keyword in all categories will not be possible as simply that option is not available by the time of this writting.

I have opened a project which you can use to get tab delimited data from those huge feeds.

The tool I use is xgawk and I have to say that I have really enjoyed coding the *awk* way.