Skip to content

chence/DataCollectionPreProcessing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PROG8245 Lab2 – Data Collection and Pre-processing

Project Synopsis

This is a data collection and pre-processing exercise. Our project is based on two synthetic data files. Using the widely adopted and powerful NumPy and pandas technology stack, we perform data loading, cleaning, and aggregation on these datasets.

Group Info

Group: Group1

GitHub Link: https://github.com/chence/DataCollectionPreProcessing.git

Team Members:

  • Ce Chen | 9007166
  • Zhuoran Zhang | 9048508

Quick-start

$ python -m venv .venv $ source .venv/bin/activate (.venv)$ pip install -r requirements.txt

Export PDF

(.venv)$ quarto render lab2.ipynb --to pdf --execute

Data-source attribution links

Since the customer_id field was not found in the provided download link, the data file used in this project was synthesized using ChatGPT.

List of other projects already in your GitHub (max 3 bullet points; link only).

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors