CAMS provides a service to get a pdf statement with all the mutual transactions within a specified period, from most of the fund houses. I needed a way to extract the data from the pdf, so that I can compute metrics on the data. Using the tabulizer package, I was able to extract the data. The script and instructions to run it are given at the repo. Feel free to create an issue on the github, if you face any problems while running the script.

Update: I abandoned the above project. This code might be useful to anyone who wants to do something similar.