Product description: trademarks data
From: Canadian Intellectual Property Office
Data for trademarks is provided in two formats. The first format is in XML with individual XML files for each individual trademark. The XML file structure is governed by the World Intellectual Property Organization (WIPO) standard ST.96 for trademark data. The second format is a DAT flat-file format of the same content as the XML files. In addition to the XML and DAT files, each trademark record also includes any relevant image files in a PNG and TIFF format. The XML files are bundled with the PNG files, while the DAT files are bundled with the TIFF files.
Record and data content
Each XML file and DAT file includes the following types of information regarding the trademark application:
- Application number and registration number
- Key dates: filed, registered, etc.
- Name, type, and category of intellectual property
- Goods and services description
- Classification codes
- Action history
Production schedule: weekly and annually
On a weekly basis, a collection of XML files are produced for all new and updated trademark applications and registered trademarks. The naming convention for each weekly folder includes the extraction date and time. Each weekly collection includes a list of all trademarks deleted from the Canadian Intellectual Property Office (CIPO) records, all updated trademark records, and an index-file (TXT) which lists the delete and update folders generated. For each week, several update folders are provided, organized in batches up to approximately 260 MB. The naming convention of the update folder indicates the range of application numbers included in the batch. Within each update folder are the XML files for each trademark listed by application number. If the trademark application also includes an image, a corresponding PNG file is included. These collections of updated and new files are provided for the current calendar year. This results in 52 weekly collections that range from 200 MB to 800 MB depending on volume of activity.
In addition to XML and PNG files, CIPO also produces DAT and TIFF files for all new and updated trademark applications and registrations. These collections of updated and new files are provided for the current calendar year. This results in 52 weekly collections that range from 5 MB to 60 MB depending on volume of activity.
On an annual basis, a complete refreshed collection of trademark files (XML and DAT) is produced. This includes all trademarks from 1865 to the most recent completed calendar year. Collections are organized by trademark application number.
As of 2017, a collection of refreshed XML trademark data consisted of 98 files and was approximately 24 GB. In regards to the XML format, an index document as well as a schema document is included and disseminated. The naming convention of the refreshed batch files includes the extraction date and the trademark application number range.
A collection of refreshed trademark data in DAT and TIFF format is also available. Three collections are available; a single DAT file with all trademark data (3.7 GB), a collection of sound files associated with trademarks (27 MB), and multiple folders containing images files (range#). As of 2017, 32 folders of TIFF files are available. Most folders contain approximately 10,000 files and are approximately 2 GB.
- Date modified: