To understand the patterns of use such as the origin, distance, frequency, day, and time of day, I have gathered the data from the City of Calgary’s Open Data Portal. Electric scooters or e-scooters are a convenient and sustainable way of getting around the city where I live. I made …
Calgary’s Shared Micromobility (e-Bikes and e-Scooters) Pilot Trips Data Dashboard
Jupyter and Pandas Project: Cleaning Data Scientist Jobs Dataset
This project includes the following data-cleaning tasks in Jupyter using the pandas library: # Get cell info # Clean Salary Estimate. # Remove range. Get average of maximum and minimum values and save it to # another column ‘Salary_Average_Clean’ # Clean Job Description # Clean Rating # Clean Company Name …
Jupyter and Pandas Project: Working on Data Scientist Jobs Dataset
This study looks at some real-world queries and solutions that can be addressed using Jupyter Notebook, pandas, and matplotlib using this dataset: https://www.kaggle.com/datasets/andrewmvd/data-scientist-jobs. Below are the results. You may see my code in my GitHub repo here: Which top 15 companies are hiring the most data scientists? What are the …
Jupyter with Pandas using Large Datasets for Sorting, Filtering, Slicing, and Looping
To give you a brief overview of sort, filter, slice, and loop in Jupyter with Pandas, I will start with some definitions and move to simple codes that perform those tasks. Jupyter is an open-source web application that is used to create and share live codes and visualizations. Pandas, on …
Jupyter with Numpy – Random Walks Simulation
This article includes: What is Random Walks Simulation? Suppose you have a coin. Each time you flip it and get head, you walk forward. If you get tails, you walk backward. You do this several times and you end up with so many patterns on the ground. The final location …
Numpy Sorting
Numpy is a Python library and one of its functions is Sorting used for organizing data into a particular order. Numpy sorting is useful for managing large amounts of data. It is also a necessary step in calculations and analyses. To use this feature, you need to import the Numpy …
Numpy Basics with matplotLib and mpl_toolkits
Matplotlib is a Python library that is used to create visualizations such as line charts, bar graphs, scatter plots, histograms, and a lot more. If you are using PyCharm, to get started with Matplotlib, go to the Terminal or click ALT+F12 on the keyboard. Install Matplotlib using pip as shown: …
CRUD Operations in Python
CRUD stands for Create, Read, Update and Delete. It is used in database applications to manage data and its persistent storage. CREATE refers to the methods of adding new data to the database. Adding in database involves the INSERT SQL statement. READ refers to the methods of retrieving existing data …
Python’s Pickling and Unpickling
Pickling is an object serialization in Python. Serializing an object means that the object is converted to byte streams and saved to a file that can be retrieved later. Below is a screenshot of my solution for Chapter 9’s exercise number 8 from Starting Out with Python, 5th Edition by …
Matplotlib Exercise: Most Frequent Characters in a Word
This exercise was from the book: Starting Out with Python, 5th Edition by Tony Gaddis. The original solution does not require a matplotlib but I added the module to chart the characters and the times they appeared in the word entered by a user. All data are dynamically generated based …