Hey everyone,
I am looking to do some clustering analysis on patents.... and of course it would be great to be able to create a document corpus that consistent of all patents year by year ( or certainly a good chunk).
Idea is to analyze all emerging areas around certain fields in genomics and bioinformatics, but I don't want to limit the analysis by including arbitrary (non ML derived ) categories...I'm looking to create the categories myself.
Does anyone know how to access/download patents in high-throughput?
Thanks so much !!
Maybe check out https://opendata.stackexchange.com/ ?
The whole reason for doing this is to analyze bioinformatics patents... I don't want to take a category view though I want to cluster all patents and find new clusters relating to bioinformatics...
If only that were mentioned somewhere in the question... I'll reopen it now.