A Big data project using Pig, Hive and Map-Reduce scripts to analyse the Stack Exchange data.
-
Updated
Aug 15, 2019 - Python
A Big data project using Pig, Hive and Map-Reduce scripts to analyse the Stack Exchange data.
A repository for showcasing my knowledge of the Apache Pig/Piglatin programming language, and continuing to learn the language.
This repository contains the H1B_Visa Applicants Data Analysis project/case study using Hadoop undertaken during the training at NIIT. MapReduce,Hive,Pig,Scoop and Shell-scripting are the technologies used.
Apache Hadoop Components Installation Guide on Windows
Big Data and AI Engineering bootcamp 2nd capstone project. Using Big Data Tools to predict the probability of university enrollment for Egypt's High School students. 🏫 📚 🔬
Apache Pig Latin Script to Convert EPrints XML to Graph GML files and geocoded CSV files
Big Data – Apache server logs analysis using Pig and Python
Apache Hadoop – A course for undergraduates | along with Apache Pig and Hive
A project that demonstrates data storage, preprocessing, and analysis using tools like HDFS, Apache Pig, and Hive, executed in an Azure virtual machine environment. The project includes cleaning and aggregating a Spotify dataset and running Hive queries to extract meaningful insights.
Collection of Docker Images Commonly Used in Data-Intensive Applications
Big data training material
This repository contains Apache Spark, Apache Hive, Apache Pig work
Add a description, image, and links to the apache-pig topic page so that developers can more easily learn about it.
To associate your repository with the apache-pig topic, visit your repo's landing page and select "manage topics."