Crowdsourced-Data Normalization with Python and Pandas

Pandas is a popular and powerful package used in Python communities for data handling and analysis. This lesson describes crowdsourcing as a form of data creation as well as how pandas can be used to prepare a crowdsourced dataset for analysis. This lesson covers managing duplicate and missing data...

Full description

Bibliographic Details
Main Author: Halle Burns
Format: Article
Language:English
Published: Editorial Board of the Programming Historian 2021-05-01
Series:The Programming Historian
Online Access:https://programminghistorian.org/en/lessons/crowdsourced-data-normalization-with-pandas