I checked up on this again recently (because I couldn’t quite remember why I stopped at 1980 in the original harvest!). My memory was that the XML format changed, but when I checked, I found that the raw XML files for 1981 to 1998 are just not available from ParlInfo. I don’t know why this is.
I’ve had a quick look and it seems that there is XML for 2002-3 – yay! I’ve spun up the harvesting notebook and adjusted a few things so that it should work with dates after 1980. If you have a look you’ll see there’s now a cell where you can set the
END_YEAR. That’s the only thing you should need to change. I successfully harvested 2002. Let me know if you have any problems.
Also, it’s worth noting that Open Australia have harvested Commonwealth Hansard XML from 2006 onwards, and it’s available through their data repository.