Skip to content

Monicajhe/Wrangle-and-Analyze-Data

Repository files navigation

Udacity-Data-Analyst-Nanodegree-Project-4-Wrangle-and-Analyze-Data

The tasks in this project are as follows:

  1. Data wrangling, which consists of:
  • Gathering data (downloadable file in the Resources tab in the left most panel of your classroom and linked in step 1 below).
  • Assessing data
  • Cleaning data
  1. Storing, analyzing, and visualizing your wrangled data
  2. Reporting on 1) your data wrangling efforts and 2) your data analyses and visualizations

Real-world data rarely comes clean. Using Python and its libraries, you will gather data from a variety of sources and in a variety of formats, assess its quality and tidiness, then clean it. This is called data wrangling. You will document your wrangling efforts in a Jupyter Notebook, plus showcase them through analyses and visualizations using Python (and its libraries) and/or SQL.

Dataset:

The dataset that you will be wrangling (and analyzing and visualizing) is the tweet archive of Twitter user @dog_rates, also known as WeRateDogs. WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog. These ratings almost always have a denominator of 10. The numerators, though? Almost always greater than 10. 11/10, 12/10, 13/10, etc. Why? Because "they're good dogs Brent." WeRateDogs has over 4 million followers and has received international media coverage.

About

Udacity Data Analyst Nanodegree Project 4

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published