Complete profile


Third-Party Information Liability Disclaimer

Some of the information on this Web page has been provided by external sources. The Government of Canada is not responsible for the accuracy, reliability or currency of the information supplied by external sources. Users wishing to rely upon this information should consult directly with the source of the information. Content provided by external sources is not subject to official languages, privacy and accessibility requirements.




Last Updated: 2011-11-24

Data Integrity Institute Inc.

Company information

Logo  
 
Legal Name:   Data Integrity Institute Inc.
Operating Name:   Data Integrity Institute Inc.
 
Mailing Address
10 Plumrose Blvd
SCARBOROUGH, Ontario
M1E 5E8
Location Address
10 Plumrose Blvd
SCARBOROUGH, Ontario
M1E 5E8
 
Telephone: (416) 282-2298
Email: info@DataIntegrityInstitute.com
Website URL: http://www.dataintegrityinstitute.com 
 
Top

Contact information

 
Drago Pejic
  Title:   Fellow, Data Integrity Institute
  Telephone:   (416) 282-2298
  Email:   info@dataintegrityinstitute.com
 

Top

Company description

 
Data Integrity Institute Inc. specializes in the research and development of advanced technologies for Corporate Metadata Repository, massive ETL, and Data Integrity management.

Data Integrity Institute Inc. offers Copula ETL Appliance, a first Application Specific ETL Appliance. Copula ETL Appliance is a complete ETL solution, including most advanced hardware (with massive parallel multi core CPUs, fastest external and internal networking, parallel disk arrays), load balancing parallel operating environment, most efficient ETL engine, and custom build according user business specification, ETL solution.

Copula ETL Appliance is guaranteed to be implemented and delivered within 3 months after business specification is submitted, without any development on the user side. Copula ETL Appliance will execute specified ETL task within 3 hours. Copula ETL Appliance provides full metadata information with every running session (also useful for eventual process reengineering of ETL process) and full data overexposure protection.
 
Country of Ownership: Canada  
Year Established: 2004
Exporting: Yes  
Quality Certification: ISO 14001 ISO 17025 NADCAP AS/EN 9100 AS/EN 9110 AS/EN 9120 AS 9003 AS 9006 DO-178B ISO/TS 16949 ANSI/ESD S20.20 ISO 9001 TL 9000 ISO 13485 ISO 22000 ISO/TS 29001
Primary Industry (NAICS): 541710 - Research and Development in the Physical, Engineering and Life Sciences
Alternate Industries (NAICS): 518210 - Data Processing, Hosting, and Related Services
541510 - Computer Systems Design and Related Services
Primary Business Activity: Services  
Total Sales ($CDN): $5,000,000 to $9,999,999 
Export Sales ($CDN): $5,000,000 to $9,999,999 

Top

Product / Service / Licensing

 
Product Name: Copula Application Specific ETL Appliance
 
Copula Enterprise ETL Appliance encapsulates all needed for an efficient ETL solution, relieving an end user of designing, implementing and deploying of an uncertain ETL application:

- Most advanced hardware, based on the latest fast internal and external networking (Infiniband, Fiber Channel, Gigabit Ethernet), multi core CPUs, and large amount of RAM and storage, composed of standard available products and integrated to best serve massive ETL processes

- Copula Enterprise ETL engine, that encapsulate extract, transform, and load of massive data vaults for simultaneous support for multiple sources and targets, error and reject handling, data cleansing, data lineage, structural, business and operational metadata and documentation generating, and data overexposure protection

- Full ETL application, implemented according provided business requirement and specification

http://www.dataintegrityinstitute.com/copula.htm

Copula Enterprise ETL Appliance acts as a black box because it encapsulates all best composed and optimized hardware and software.

Copula Enterprise ETL Appliance is guaranteed to be delivered within three months, after signing of the deal and the business specification is received.

Copula Enterprise ETL Appliance guaranties to finish all ETL processes within three hours.

Copula Enterprise ETL Appliance costs US$1,000,000.00 for first TB of data to be processed. For each next TB of data, Copula Enterprise ETL Appliance costs US$100,000.00 more, because three months of delivery and three hours of processing are still equally guaranteed.

Long term maintenance and prompt support for eventual Change Requests are subjects of separate convenient contract.

 
Product Name: Data Lineage
 
Data Lineage is a Data Integrity Institute Inc. metadata application that allows to travel forward (Daniel approach) or back (John approach) through corporate data and explore data origins and destinations.

http://www.dataintegrityinstitute.com/Data_Lineage.htm

No single vendor's Structural or Operational metadata scanner (either from ASG Rochade or third party) can create conditions for a complete corporate Data Lineage tracing and analysis.

Data Integrity Institute Inc. implements a Data Lineage metadata application by customizing vendors' Structural and Operational metadata scanners and building of dedicated scanners for Business and other metadata. Scanned metadata are standardized and enriched with business integration attributes (purpose, abbreviations, etc.) and linked according business logic.

 
Product Name: Enterprise Metadata System
 
Data Integrity Institute Inc. offers the most sophisticated metadata solutions built on the top of one of the world's most powerful metadata management engines, ASG Rochade, including most important metadata applications for data management: Impact Analysis, Data Lineage, Data Standardization, Regulatory Compliance, Data Overexposure Protection, Metadata Reporting.

- Impact Analysis is a Data Integrity Institute Inc. metadata application that allows to explore impacts of particular changes in the data model (impacts of adding, changing or removing of database or business objects).

- Data Lineage is a Data Integrity Institute Inc. metadata application that allows to travel forward (Daniel approach) or back (John approach) through corporate data and explore data origins and destinations.

- Data Standardization is a Data Integrity Institute Inc. metadata application that allows full business oriented data integration, avoiding redundancy and bottlenecks.

- Regulatory Compliance is a Data Integrity Institute Inc. metadata application that allows reasonable full compliance with Sarbanes - Oxley Act and other regulatory standards.

- Data Integrity Institute Inc. offers several (Data Overexposure Protection, Islamic Banking, for example) or implements according specification Corporate Specific metadata applications.

- Data Integrity Institute Inc. offers and implements according specification customized interactive and unattended predefined and ad hoc metadata Reports.

http://www.dataintegrityinstitute.com/Enterprise_Metadata_System.htm


 
Product Name: ETL project management
 
Data Integrity Institute Inc. provides on site competitive cost efficient ETL project management, either for new ETL projects or to save and successfully deliver an existing ETL project in jeopardy.

http://www.dataintegrityinstitute.com/ETL_project_management.htm

On site independent ETL project management will make an ETL project feasible and significantly save implementation time and money. Risks and bottlenecks early identification in an ETL project is very important because even major ETL tool vendors recommend best practice which non critical following can jeopardize an ETL project.

 
Product Name: Impact Analysis
 
Impact Analysis is a Data Integrity Institute Inc. metadata application that allows to explore impacts of particular changes in the data model (impacts of adding, changing or removing of database or business objects).

http://www.dataintegrityinstitute.com/Impact_Analysis.htm

No single vendor's Structural or Operational metadata scanner (either from ASG Rochade or third party) can create conditions for a complete corporate Impact Analysis.

Data Integrity Institute Inc. implements an Impact Analysis metadata application by customizing vendors' Structural and Operational metadata scanners and building of dedicated scanners for Business and other metadata. Scanned metadata are standardized and enriched with business integration attributes (purpose, abbreviations, etc.) and linked according business logic.

 
Product Name: On site ETL seminars
 
Data Integrity Institute Inc. provides on site ETL seminars, ETL tool selection, ETL platform selection, ETL project review, ETL performances estimation, ETL obstacles and bottlenecks identification, ETL best practice revision, staff coaching and training.

http://www.dataintegrityinstitute.com/on_site_ETL_seminars.htm

On site independent ETL seminars will make an ETL project feasible and significantly save implementation time and money. Risks and bottlenecks early identification in an ETL project is very important because even major ETL tool vendors recommend best practice which non critical following can jeopardize an ETL project.

 

Top

Technology profile

Data Integrity Institute Inc. specializes in the research and development of advanced technologies for Corporate Metadata Repository, massive ETL, and Data Integrity management.

Data Integrity Institute Inc. offers Copula ETL Appliance, a first Application Specific ETL Appliance. Copula ETL Appliance is a complete ETL solution, including most advanced hardware (with massive parallel multi core CPUs, fastest external and internal networking, parallel disk arrays), load balancing parallel operating environment, most efficient ETL engine, and custom build according user business specification, ETL solution.

Copula ETL Appliance is guaranteed to be implemented and delivered within 3 months after business specification is submitted, without any development on the user side. Copula ETL Appliance will execute specified ETL task within 3 hours. Copula ETL Appliance provides full metadata information with every running session (also useful for eventual process reengineering of ETL process) and full data overexposure protection.
Top

Market profile

Alliances:

  • Financial
  • Sales/Marketing
  • Technology

Strategic alliances:

Data Integrity Institute Inc. specializes in the research and development of advanced technologies for Corporate Metadata Repository, massive ETL, and Data Integrity management.

Data Integrity Institute Inc. offers Copula ETL Appliance, a first Application Specific ETL Appliance. Copula ETL Appliance is a complete ETL solution, including most advanced hardware (with massive parallel multi core CPUs, fastest external and internal networking, parallel disk arrays), load balancing parallel operating environment, most efficient ETL engine, and custom build according user business specification, ETL solution.

Copula ETL Appliance is guaranteed to be implemented and delivered within 3 months after business specification is submitted, without any development on the user side. Copula ETL Appliance will execute specified ETL task within 3 hours. Copula ETL Appliance provides full metadata information with every running session (also useful for eventual process reengineering of ETL process) and full data overexposure protection.

Industry sector market interests:

  • Agriculture
  • Construction
  • Consumer Products
  • Culture
  • Environment
  • Fishery
  • Forestry
  • Information Technology and Telecommunications
  • Manufacturing
  • Medical/Biotechnology/Chemical
  • Mining/Petroleum/Gas
  • Service Industry
  • Tourism
  • Transportation
  • Wholesale/Retail
  • Aerospace
  • Defence
  • Automotive
  • Food and Beverage Manufacturing
  • Furniture and Wood Product
  • Pulp and Paper
  • Plastics and Rubber Products
  • Primary and Fabricated Metal
  • Electrical Equipment
  • Textile and Clothing

Geographic markets:

Export experience:
  • Germany
  • United States
  • California
  • New Jersey
  • New York
  • Virginia
  • Wisconsin
Actively pursuing:
  • Algeria
  • Australia
  • Austria
  • Belgium
  • Brazil
  • Brunei Darussalam
  • Chile
  • China
  • Denmark
  • Djibouti
  • Egypt
  • France
  • Greece
  • Hong Kong
  • Iceland
  • Indonesia
  • Iran, Islamic Republic of
  • Ireland
  • Israel
  • Italy
  • Japan
  • Jordan
  • Kazakhstan
  • Korea, Republic of
  • Kuwait
  • Kyrgyzstan
  • Lebanon
  • Libyan Arab Jamahiriya
  • Liechtenstein
  • Luxembourg
  • Macao
  • Malaysia
  • Mexico
  • Micronesia, Federated States of
  • Monaco
  • Morocco
  • Netherlands
  • New Zealand
  • Oman
  • Pakistan
  • Qatar
  • Russian Federation
  • San Marino
  • Saudi Arabia
  • Singapore
  • South Africa
  • Spain
  • Switzerland
  • Syrian Arab Republic
  • Taiwan
  • Tajikistan
  • Tunisia
  • Turkey
  • Turkmenistan
  • United Arab Emirates
  • United Kingdom
  • Uzbekistan
  • Yemen
  • Alabama
  • Alaska
  • Arizona
  • Arkansas
  • Colorado
  • Connecticut
  • Delaware
  • District of Columbia
  • Florida
  • Georgia
  • Hawaii
  • Idaho
  • Illinois
  • Indiana
  • Iowa
  • Kansas
  • Kentucky
  • Louisiana
  • Maine
  • Maryland
  • Massachusetts
  • Michigan
  • Minnesota
  • Mississippi
  • Missouri
  • Montana
  • Nebraska
  • Nevada
  • New Hampshire
  • New Mexico
  • North Carolina
  • North Dakota
  • Ohio
  • Oklahoma
  • Oregon
  • Pennsylvania
  • Rhode Island
  • South Carolina
  • South Dakota
  • Tennessee
  • Texas
  • Utah
  • Vermont
  • Washington
  • West Virginia
  • Wyoming

Top

Sector information

Unique applications:

Data Integrity Institute Inc. offers Copula ETL Appliance, a first Application Specific ETL Appliance. Copula ETL Appliance is a complete ETL solution, including most advanced hardware (with massive parallel multi core CPUs, fastest external and internal networking, parallel disk arrays), load balancing parallel operating environment, most efficient ETL engine, and custom build according user business specification, ETL solution.
Copula ETL Appliance is guaranteed to be implemented and delivered within 3 months after business specification is submitted, without any development on the user side. Copula ETL Appliance will execute specified ETL task within 3 hours. Copula ETL Appliance provides full metadata information with every running session (also useful for eventual process reengineering of ETL process) and full data overexposure protection.

ETL solution for extremely large volume of data of 2500 million records using Ascential DataStage in an NCR Teradata and IBM DB2 environment.

Key / Major clients:

Bank of Canada, Ottawa, ON
Bank of America, Dallas, TX
Bruce Power, Tiverton, ON
RBC Royal Bank, Toronto, ON
Wisconsin State Department Of Transportation, Madison, WI
Philip Morris, Richmond, VA
Blockbuster, Dallas, TX
Banque Nationale, Montreal, PQ
Hoffmann - La Roche, Nutley, NJ
Enbridge, Consumers Gas, Toronto, ON
Amex, American Express, Markham, ON Canada
Amex, American Express, Brighton, ES United Kingdom
CIBC - Canadian Imperial Bank of Commerce, Toronto, ON
Gap, San Francisco, CA
Bell Canada, Toronto, ON
Bell Canada, Montreal, PQ
WCB Alberta, Edmonton, AB
Ministry of Health, Ontario
ISM - IBM Global Services, Toronto, ON
d.d. synergy GmbH, Hamburg, Germany
Glaxo Wellcome Inc., Mississauga, ON
IBM Canada Ltd., Toronto, ON
CGI, Toronto, ON
CGI, Montreal, PQ
Westbury Canadian Life, Hamilton, ON

Success stories:

Data Integrity Institute Inc. has developed and through research continues to systematically improve upon some of the most efficient sorting algorithm based on advanced nanotechnology research.

Today's major database and ETL engines use comparison based sorting algorithms derived from the well known quick sort, heap sort, merge sort, etc., algorithms. All of these comparison based algorithms operate on the item level, utilizing multiple active and passive comparisons per item.

Active comparison compares a selected item with other items by calling comparison functions customized to handle a comparison for a particular item type, which returns three possible outcomes: equal (0), greater than (+1), or less than (-1). The comparison process can be very slow because of the need to compare each item to other items multiple times and especially because of the customized comparison function call overhead.

Passive comparison checks each item several times for the position of that item in the memory array and/or in the file to ensure that that item is within its expected boundaries. This is done to eventually swap compared items and for many other purposes. The position of an item is represented by an integer, but it is still time consuming to perform the checks several times for each item.

Through its nanotechnology research initiative, Data Integrity Institute Inc. has developed the most efficient sorting algorithm that does not operate on the item level, but rather on the item sub level (nano level), and never performs either active or passive comparison of items. With such fine granulation there is no overhead making the sorting process fast and efficient.

Nanotechnology Sorting Statistics ( on a single workstation )

Nanotechnology Sorting sorts an array of 1 billion ( 1 000 000 000 ) double, 64 bit, floating point numbers (IEEE Standard 754), within one second on the single 4 core CPU with 24 GB RAM memory.

Workstation: Single

Motherboard: One

CPU: One

Cores per CPU: Four

RAM: 24 GB

Operating system: Either Microsoft Windows 64 bit or Linux 64 bit

Sorted data: Random generated array of 1 billion double, 64 bit, floating point numbers (IEEE Standard 754)

FPU used: No, all notechnology, sub item level access

Data types supported: Any

SQL data types supported: All SQL data types, including all types of Binary Large Objects (BLOB)

Most efficient data to sort: Integer

Linearity distortion (slowing) when sorted double, 64 bit, floating point numbers (IEEE Standard 754), data type, compared to 64 bit integer data type: Less than 0.01 (1%)

- Description: Sorting of an array of double, 64 bit, floating point numbers, will take no more than 1.01 of time of sorting of a same size array of 64 bit integers

Linearity distortion (slowing) when size of a sorted array is doubled: Less than 0.01 (1%)

- Description: Sorting of a twice size array of same data type will take no more than 2.01 of time

Nanotechnology Sorting ( hard drive operations )

When an array to be sorted exceeds the amount of available RAM so hard drive is used, Nanotechnology Sorting accesses a hard drive in following steps:

1. Sequential read of data

- Description: Data are read sequentially in very large blocks, depends of available RAM. Next block is taken next from previous, and so on.

2. Sequential write of data

- Description: Data are written sequentially in very large blocks, depends of available RAM. Next block is placed next to previous, and so on.

3. Random read of data

- Description: Data are read randomly, rather in large blocks (still not each item individually), depends of available RAM. Next block is taken according internal Nanotechnology Sorting algorithm logic.

4. Sequential write of data

- Description: Data are written sequentially in very large blocks, depends of available RAM. Next block is placed next to previous, and so on.

Number of hard drives support: Limited by operating system

Number of files support: Limited by operating system

Size of files support: Limited by operating system

Number of nodes support: Limited by operating system

Size of RAM support: Limited by operating system

Support for multiple operating systems for a single sorting session: Yes

Number of multiple operating systems for a single sorting session: No limit

Support for different operating systems for a single sorting session: Yes

Number of different operating systems for a single sorting session: No limit

Contact Data Integrity Institute Inc.

For further information on Data Integrity Institute Inc.'s research or how Data Integrity Institute Inc. can help you to implement, save or maintain an enterprise ETL or metadata project, please send a detailed inquiry to: info@DataIntegrityInstitute.com , or call (416) 282-2298

Testimonial:

http://www.b-eye-network.com/blogs/linstedt/archives/2007/02/bidw_appliances.php


http://www.dataintegrityinstitute.com/Enterprise_Metadata_System.htm
http://www.dataintegrityinstitute.com/Impact_Analysis.htm
http://www.dataintegrityinstitute.com/Data_Lineage.htm
http://www.b-eye-network.com/blogs/linstedt/archives/2007/05/where_o_where_i.php#more

Top



Note: This document is presented in the language provided by the author/source. Most of the information contained in Canadian Company Capabilities (CCC) has been provided by sources external to Industry Canada. The accuracy, currency and reliability of the information contained in CCC are the sole responsibility of the registered companies and related organizations. Industry Canada assumes no responsibility in this respect. The registered businesses shall ensure the continued monitoring of the information contained in CCC and ask that it be modified when necessary.