Calling all pedes who are familiar with python to help process the list of CCP members. Since I couldn't find a copy of the translated data in English, I went ahead and put together a quick script this morning.
If there's already a translated copy of the data in English that's available, that would be awesome...but I haven't seen any.
The code is hitting the Google Translate API (ie translate.google.com), but I'm only able to process ~6 rows/second or 22,000 rows per hour. It appears Google is throttling the requests by IP address (already tried multi-threaded).
I'd like to split up the file into chunks so we can tackle this as a group.
Any help or recommendations would be appreciated. Let's get this data out there so more people can see and work with it.
You can access the repository on GitHub: https://github.com/StopTheCCP/CCP-Database-Leak
https://thedonald.win/p/11R4SXt53J/lets-translate-the-195-million-c/
thank you