Census of Manufactures - Mobile Header

Industry Classification

This step in the pipeline standardizes the industry classifications for all establishments across the Census of Manufactures from 1850 to 1880. Originally, industry information was recorded in inconsistent and often ambiguous text entries (e.g., “mill,” “cabinet,” or “bootmaker”), which made analysis across years and locations difficult. This step addresses that by cleaning the original text strings, matching them to standardized industry categories using crosswalks, and assigning each firm a set of consistent industry labels. These include both detailed and broader classifications that allow users to group establishments according to the level of industry specificity needed for their analysis.

The process begins by cleaning raw industry labels and matching them to cleaned values previously compiled by research assistants. From there, entries are either automatically classified using keyword-based logic (especially for common industries like flour and lumber mills) or matched against manually reviewed crosswalks. For the 1880 special schedules, industry assignments are often hardcoded based on the known content of each form (e.g., all firms in SS2 are boot and shoe manufacturers). In the final step, each firm is assigned hierarchical industry codes ranging from specific (“flour and grist mills”) to broad (“food manufacturing”), making the data ready for consistent, high-quality historical analysis.

code-step1

code-step2

code-support1

code-support2