Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

FIT5196 - Data wrangling

Overview

This unit introduces tools and techniques for data wrangling. It will cover the problems that prevent raw data from being effectively used in analysis and the data cleansing and pre-processing tasks that prepare it for analytics. These include, for example, the handling of bad and missing data, data integration and initial feature selection. It will also introduce text mining and web analytics. Python and the Pandas environment will be used for implementation.

Offerings

 S1-01-CLAYTON-FLEXIBLE

Location:    Clayton

Teaching period:    First semester

Attendance mode:    A combination of on-campus and online teaching (FLEXIBLE)

 S2-01-CLAYTON-FLEXIBLE

Location:    Clayton

Teaching period:    Second semester

Attendance mode:    A combination of on-campus and online teaching (FLEXIBLE)

 S2-01-MALAYSIA-ON-CAMPUS

Location:    Malaysia

Teaching period:    Second semester

Attendance mode:    Teaching activities are on-campus (ON-CAMPUS)

 

Requisites

Prerequisite

 

FIT9133                  6 CP                 Programming foundations in python

 

 

Algorithms and programming foundations in Python

 


 


 

ITI5196

 

6 CP

 

Data wrangling

 

 

ITO5196

 

 

6 CP

 

 

Data wrangling

 

 

Rules

Enrolment Rule

Prerequisites:

For C6007 students who commenced in 2020: None

Contacts

 Chief Examiner(s)

Dr Jackie Rong

 

Email:    [email protected]

Offering(s):

   Applies to all offerings

 

Learning outcomes

 

On successful completion of this unit, you should be able to:

1.     Parse data in the required format;

2.     Assess the quality of data for problem identification;

3.     Resolve data quality issues ready for the data analysis process;

4.     Integrate data sources for data enrichment;

5.     Document the wrangling process for professional reporting;

6.    Write program scripts for data wrangling processes.

 

Teaching approach

Active learning

 

 Assessment

Assessment 1

Value %:    35


Assessment 2

Value %:    35

 

Assessment 3

Value %:    30


 Scheduled teaching activities

Applied sessions

Total hours:    24 hours

Offerings:

   Applies to all offerings


Lectures

Total hours:    24 hours

Offerings:

   Second semester, Malaysia, Teaching activities are on-campus (ON-CAMPUS)

 

Seminars

Total hours:    24 hours

Offerings:

   First semester, Clayton, A combination of on-campus and online teaching (FLEXIBLE)

   Second semester, Clayton, A combination of on-campus and online teaching (FLEXIBLE)


Workload requirements

Workload

Minimum total expected workload to achieve the learning outcomes for this unit is 144 hours per semester typically comprising a mixture of scheduled online and face to face learning activities   and independent study. Independent study may include associated reading and preparation for   scheduled teaching activities.

Learning resources


 

Technology resources

 

The Programming environments: Python 3 has been added. Python 3 -

https://www.python.org/downloads/ and Jupyter Notebook - https://jupyter.org/

https://www.anaconda.com/distribution/