Explore projects
-
CA675 assignment two, creating full stack application leveraging data engineering tools and approach
Updated -
CA675 Cloud assignment to perform spam detection with TF IDF using mapreduce Author : Nirav Patel (23265654)
Updated -
Updated
-
Updated
-
The aim of the assignment is to download and perform analysis on Stack Exchange data. It contains technology-related queries and answers. The dataset has 2,00,000 records and various Big Data Technologies such as Hadoop, Pig, Hive, Map-reduce.
Updated -
Analytics code from Insight team to generate periodicity graph for ARC trails.
Updated -
BERT is a neural network language model architecture introduced by Google in 2018 (Devlin et al. 2018). When training a BERT model, the network is trained not to predict the next token in a sequence but to predict a masked token as in a cloze test.
Updated -
Updated
-
Updated
-
All source code and related materials for Assignment 3 in CA4022 - Data at Speed and Scale.
Updated -
-
Sentiment analysis system improvements (using stop word removal and named entity recognition) & spam classification using BERT.
Updated -
Updated
-
Updated