Note: Please read this confluence page which explains the complete architecture of how RDocumentation works.
Set up an AWS Lambda pipeline in Node.js that every hour:
- Reads all packages and their versions from CRAN, Bioconductor, and Github.
- If the package doesn't already exist in the S3 bucket assets.rdocumentation.org, it extracts the package information, and sends a job to the rdocs-r-worker SQS queue with basic information about the package.
- The rdocs-r-worker will be processed by the RPackageParser service.
- The lambdas also update the JSON state files in the S3 bucket.
TODO: replace these instructions because apex doesn't work anymore.
Use apex command to deploy and invoke the lambda functions
Examples:
apex deploy unzip
apex invoke unzip
apex metrics unzip
See the LICENSE file for license rights and limitations (MIT).