Cyclistic Bike Share Analysis

Du Pham

Goal: Provide graphs about how casual riders and annual members use Cyclistic bikes differently so that the team can come up with appropriate stategies to maximize the number of annual memberships.

  1. Data Prep: we collect the data files for the year 2019 from the webpage "https://divvy-tripdata.s3.amazonaws.com/index.html" to avoid the bias in data of 2020 due to the COVID pandemic. We combine the data files in one data frame.
  2. Data Analysis:
  1. Conclusion:
  2. Further considerations:

1. Data Preparation:

Read the data .csv files

See the data type and the first few row of data.

We want to find if there is any COLUMNS with null values

The missing data are in columns gender and birthyear which do not carry important numeric information for the analysis. We decide to leave the data as is.

2. Analysis:

We observe that the mean and median usage of the casual riders are 3,665 and 1,283, which are much higher than those of annual members, 749 and 528 resp.


We now partition the users into groups of specific time windows and carry the plots to compare the numbers of users between the casual and annual riders.

Here is a plot when the scales of each group are different to see clearer the groups d and f

3. Conclusion

4. Further Consideration