This is an E-Commerce product Extraction and Grouping System. It works on Hadoop over AWS, and included are the AWS scrips that help you manage the system.