-
data-engineering-exercise Public
Forked from jbaehne/data-engineering-exerciseThis repo contains an outline of the exercise we will be giving to data engineering candidates.
Python MIT License UpdatedMar 15, 2024 -
Spark-Jumble Public
This project use spark to solve jumble questions
Jupyter Notebook UpdatedAug 8, 2019 -
This project use Hive to create partitioned tables from HDFS
1 UpdatedApr 28, 2019 -
-
-
Tensorflow-Estimator-API Public
This project use Tensorflow estimator API to implement Regression and Classification problems
Jupyter Notebook UpdatedSep 19, 2018 -
This project use java & MongoDB JDBC to implement specific requirements
Java UpdatedAug 23, 2018 -
MongoDB-REST-API-design Public
This project use MongoDB and REST api to desgin a simple API to implement GET, POST, PUT and DELETE functions, use POSTMAN to test the functions
JavaScript UpdatedAug 13, 2018 -
This project use oozie & sqoop incremental job to only import the updated data(updated/inserted) into HDFS and Hive
UpdatedAug 1, 2018 -
This project use Oozie to automate the importing used by Sqoop & Hive
1 UpdatedJul 31, 2018 -
This project use Java and Data Access Object (DAO) to implemented the specific system requirements.
-
This project use Sqoop to import data from MySQL(Linux) to HDFS
UpdatedJul 26, 2018 -
Scrapy-Python Public
This project use Scrapy Framework to crawl data from websites and analyze the data with machine learning
Python UpdatedJun 21, 2018 -
This project use Pandas and probability theory to implement specific NLP problems
Python UpdatedJun 21, 2018 -
machine_learning_examples Public
Forked from lazyprogrammer/machine_learning_examplesA collection of machine learning examples and tutorials.
Python UpdatedNov 13, 2017