This project includes the following data-cleaning tasks in Jupyter using the pandas library:
# Get cell info # Clean Salary Estimate. # Remove range. Get average of maximum and minimum values and save it to # another column 'Salary_Average_Clean' # Clean Job Description # Clean Rating # Clean Company Name # Clean Size. Make 2 additional columns for min and max number of employees # Clean Founded, Industry, Sector, Competitors by replacing -1 with 'Unknown' # Clean Easy Apply. Replace -1 with False
You may find my code in my GitHub repo here.