
Research Article
Collection and Integration of Patent Data for Analysis and Validation of Brazilian Technical Production
@INPROCEEDINGS{10.1007/978-3-031-22324-2_4, author={Raulivan Rodrigo da Silva and Thiago Magela Rodrigues Dias and Washington Lu\^{\i}s Ribeiro de Carvalho Segundo}, title={Collection and Integration of Patent Data for Analysis and Validation of Brazilian Technical Production}, proceedings={Data and Information in Online Environments. Third EAI International Conference, DIONE 2022, Virtual Event, July 28-29, 2022, Proceedings}, proceedings_a={DIONE}, year={2022}, month={12}, keywords={Patents INPI Lattes platform Espacenet Patentometry}, doi={10.1007/978-3-031-22324-2_4} }
- Raulivan Rodrigo da Silva
Thiago Magela Rodrigues Dias
Washington Luís Ribeiro de Carvalho Segundo
Year: 2022
Collection and Integration of Patent Data for Analysis and Validation of Brazilian Technical Production
DIONE
Springer
DOI: 10.1007/978-3-031-22324-2_4
Abstract
The main objective of this work is to present an overview of Brazilian technical production from the analysis of patents registered in national repositories such as the National Institute of Industrial Property (INPI), as well as in international repositories such as Espacenet. Initially, all the INPI patent identifiers are collected so that later, all the data of these patents are collected in the Espacenet repository. In this way, a large local data repository is characterized, standing out for having consistent and structured data. This strategy makes it possible to analyze the entire dataset with the adoption of several metrics, making it possible to present a broad and unprecedented portrait of Brazilian technical production. With the dataset retrieved, it is still possible to validate other repositories or datasets, such as the curriculum base of the Lattes Platform, in which their curricula in general do not have all the information about the patents duly registered. However, it is concluded that the collection and integration of data extracted from patent documents and stored in local repositories, significantly enable patentometric analyzes that could not be performed in distributed repositories.