General Requirements
- Machine Powerful enough for Image Related ML Tasks
- NEXT.js (React.js)
- Some backend framework that can interface with ML Models
- Terraform (Infrastructure as Code Tool)
- AWS Suite -- Services that will be used but are not limited to:
- AWS S3 (Object Storage)
- AWS EC2 (Virtual Machines)
- AWS Networking Resources (VPC)
- Possible use of AWS Lambda (Serverless Functions) and AWS DynamoDB (NoSQL DB)
- Possible use of AWS Fargate or AWS EKS in place of EC2 VMs
Terminology
- Tile: an image with a caption. The user builds sentences from these tiles, and that sentence is then vocalized.