US OVERVIEW¶
Background¶
XX
📚 Data source¶
From the earliest patent that we consider US1A to patent XX (excluded), we collected image data (png) from Espacenet and OCRed the first page using Tesseract v5.
| Patent office | Time span (publication year) | Kind code(s) |
|---|---|---|
| US | 1836-1980 | A; B1,B2* |
Notes: : Before 2001; *: After 2001
| Publication number (range) | Data source | Pre-processing | E.g. | Format # |
|---|---|---|---|---|
| US1A-US1583766A | Espacenet | OCR | US75A | 1 |
| US1583767A-US1920166A | Espacenet | OCR | US1602651A | 2 |
| US1920167A-US3554066A | Espacenet | OCR | US2427801A | 3 |
| US3554067A-... | Espacenet | OCR | US3564067A | 4 |
🚜 Extraction schema¶
See the annotation guidelines.
🔮 Models¶
See the models card.
Other¶
See the geocoding and citizenship and deduplication documentation.