Complete profile
Last Updated: 2011-11-24
Data Integrity Institute Inc.
Company information
|
||||
| Legal Name: Data Integrity Institute Inc. | ||||
| Operating Name: Data Integrity Institute Inc. | ||||
|
Mailing Address
10 Plumrose Blvd SCARBOROUGH, Ontario M1E 5E8 |
Location Address
10 Plumrose Blvd SCARBOROUGH, Ontario M1E 5E8 |
|||
| Telephone: (416) 282-2298 | ||||
| Email: info@DataIntegrityInstitute.com | ||||
| Website URL: http://www.dataintegrityinstitute.com | ||||
Contact information
| Drago Pejic | ||
| Title: Fellow, Data Integrity Institute | ||
| Telephone: (416) 282-2298 | ||
| Email: info@dataintegrityinstitute.com | ||
Company description
|
Data Integrity Institute Inc. specializes in the research and development of advanced technologies for Corporate Metadata Repository, massive ETL, and Data Integrity management.
Data Integrity Institute Inc. offers Copula ETL Appliance, a first Application Specific ETL Appliance. Copula ETL Appliance is a complete ETL solution, including most advanced hardware (with massive parallel multi core CPUs, fastest external and internal networking, parallel disk arrays), load balancing parallel operating environment, most efficient ETL engine, and custom build according user business specification, ETL solution. Copula ETL Appliance is guaranteed to be implemented and delivered within 3 months after business specification is submitted, without any development on the user side. Copula ETL Appliance will execute specified ETL task within 3 hours. Copula ETL Appliance provides full metadata information with every running session (also useful for eventual process reengineering of ETL process) and full data overexposure protection. |
|
| Country of Ownership: | Canada |
| Year Established: | 2004 |
| Exporting: | Yes |
| Quality Certification: | ISO 14001 ISO 17025 NADCAP AS/EN 9100 AS/EN 9110 AS/EN 9120 AS 9003 AS 9006 DO-178B ISO/TS 16949 ANSI/ESD S20.20 ISO 9001 TL 9000 ISO 13485 ISO 22000 ISO/TS 29001 |
| Primary Industry (NAICS): | 541710 - Research and Development in the Physical, Engineering and Life Sciences |
| Alternate Industries (NAICS): |
518210 - Data Processing, Hosting, and Related Services 541510 - Computer Systems Design and Related Services |
| Primary Business Activity: | Services |
| Total Sales ($CDN): | $5,000,000 to $9,999,999 |
| Export Sales ($CDN): | $5,000,000 to $9,999,999 |
Product / Service / Licensing
| Product Name: |
Copula Application Specific ETL Appliance |
|
|
Copula Enterprise ETL Appliance encapsulates all needed for an efficient ETL solution, relieving an end user of designing, implementing and deploying of an uncertain ETL application:
- Most advanced hardware, based on the latest fast internal and external networking (Infiniband, Fiber Channel, Gigabit Ethernet), multi core CPUs, and large amount of RAM and storage, composed of standard available products and integrated to best serve massive ETL processes - Copula Enterprise ETL engine, that encapsulate extract, transform, and load of massive data vaults for simultaneous support for multiple sources and targets, error and reject handling, data cleansing, data lineage, structural, business and operational metadata and documentation generating, and data overexposure protection - Full ETL application, implemented according provided business requirement and specification http://www.dataintegrityinstitute.com/copula.htm Copula Enterprise ETL Appliance acts as a black box because it encapsulates all best composed and optimized hardware and software. Copula Enterprise ETL Appliance is guaranteed to be delivered within three months, after signing of the deal and the business specification is received. Copula Enterprise ETL Appliance guaranties to finish all ETL processes within three hours. Copula Enterprise ETL Appliance costs US$1,000,000.00 for first TB of data to be processed. For each next TB of data, Copula Enterprise ETL Appliance costs US$100,000.00 more, because three months of delivery and three hours of processing are still equally guaranteed. Long term maintenance and prompt support for eventual Change Requests are subjects of separate convenient contract. |
||
| Product Name: |
Data Lineage |
|
|
Data Lineage is a Data Integrity Institute Inc. metadata application that allows to travel forward (Daniel approach) or back (John approach) through corporate data and explore data origins and destinations.
http://www.dataintegrityinstitute.com/Data_Lineage.htm No single vendor's Structural or Operational metadata scanner (either from ASG Rochade or third party) can create conditions for a complete corporate Data Lineage tracing and analysis. Data Integrity Institute Inc. implements a Data Lineage metadata application by customizing vendors' Structural and Operational metadata scanners and building of dedicated scanners for Business and other metadata. Scanned metadata are standardized and enriched with business integration attributes (purpose, abbreviations, etc.) and linked according business logic. |
||
| Product Name: |
Enterprise Metadata System |
|
|
Data Integrity Institute Inc. offers the most sophisticated metadata solutions built on the top of one of the world's most powerful metadata management engines, ASG Rochade, including most important metadata applications for data management: Impact Analysis, Data Lineage, Data Standardization, Regulatory Compliance, Data Overexposure Protection, Metadata Reporting.
- Impact Analysis is a Data Integrity Institute Inc. metadata application that allows to explore impacts of particular changes in the data model (impacts of adding, changing or removing of database or business objects). - Data Lineage is a Data Integrity Institute Inc. metadata application that allows to travel forward (Daniel approach) or back (John approach) through corporate data and explore data origins and destinations. - Data Standardization is a Data Integrity Institute Inc. metadata application that allows full business oriented data integration, avoiding redundancy and bottlenecks. - Regulatory Compliance is a Data Integrity Institute Inc. metadata application that allows reasonable full compliance with Sarbanes - Oxley Act and other regulatory standards. - Data Integrity Institute Inc. offers several (Data Overexposure Protection, Islamic Banking, for example) or implements according specification Corporate Specific metadata applications. - Data Integrity Institute Inc. offers and implements according specification customized interactive and unattended predefined and ad hoc metadata Reports. http://www.dataintegrityinstitute.com/Enterprise_Metadata_System.htm |
||
| Product Name: |
ETL project management |
|
|
Data Integrity Institute Inc. provides on site competitive cost efficient ETL project management, either for new ETL projects or to save and successfully deliver an existing ETL project in jeopardy.
http://www.dataintegrityinstitute.com/ETL_project_management.htm On site independent ETL project management will make an ETL project feasible and significantly save implementation time and money. Risks and bottlenecks early identification in an ETL project is very important because even major ETL tool vendors recommend best practice which non critical following can jeopardize an ETL project. |
||
| Product Name: |
Impact Analysis |
|
|
Impact Analysis is a Data Integrity Institute Inc. metadata application that allows to explore impacts of particular changes in the data model (impacts of adding, changing or removing of database or business objects).
http://www.dataintegrityinstitute.com/Impact_Analysis.htm No single vendor's Structural or Operational metadata scanner (either from ASG Rochade or third party) can create conditions for a complete corporate Impact Analysis. Data Integrity Institute Inc. implements an Impact Analysis metadata application by customizing vendors' Structural and Operational metadata scanners and building of dedicated scanners for Business and other metadata. Scanned metadata are standardized and enriched with business integration attributes (purpose, abbreviations, etc.) and linked according business logic. |
||
| Product Name: |
On site ETL seminars |
|
|
Data Integrity Institute Inc. provides on site ETL seminars, ETL tool selection, ETL platform selection, ETL project review, ETL performances estimation, ETL obstacles and bottlenecks identification, ETL best practice revision, staff coaching and training.
http://www.dataintegrityinstitute.com/on_site_ETL_seminars.htm On site independent ETL seminars will make an ETL project feasible and significantly save implementation time and money. Risks and bottlenecks early identification in an ETL project is very important because even major ETL tool vendors recommend best practice which non critical following can jeopardize an ETL project. |
||
Technology profile
Data Integrity Institute Inc. specializes in the research and development of advanced technologies for Corporate Metadata Repository, massive ETL, and Data Integrity management.Data Integrity Institute Inc. offers Copula ETL Appliance, a first Application Specific ETL Appliance. Copula ETL Appliance is a complete ETL solution, including most advanced hardware (with massive parallel multi core CPUs, fastest external and internal networking, parallel disk arrays), load balancing parallel operating environment, most efficient ETL engine, and custom build according user business specification, ETL solution.
Copula ETL Appliance is guaranteed to be implemented and delivered within 3 months after business specification is submitted, without any development on the user side. Copula ETL Appliance will execute specified ETL task within 3 hours. Copula ETL Appliance provides full metadata information with every running session (also useful for eventual process reengineering of ETL process) and full data overexposure protection.
Market profile
Alliances:
- Financial
- Sales/Marketing
- Technology
Strategic alliances:
Data Integrity Institute Inc. specializes in the research and development of advanced technologies for Corporate Metadata Repository, massive ETL, and Data Integrity management.Data Integrity Institute Inc. offers Copula ETL Appliance, a first Application Specific ETL Appliance. Copula ETL Appliance is a complete ETL solution, including most advanced hardware (with massive parallel multi core CPUs, fastest external and internal networking, parallel disk arrays), load balancing parallel operating environment, most efficient ETL engine, and custom build according user business specification, ETL solution.
Copula ETL Appliance is guaranteed to be implemented and delivered within 3 months after business specification is submitted, without any development on the user side. Copula ETL Appliance will execute specified ETL task within 3 hours. Copula ETL Appliance provides full metadata information with every running session (also useful for eventual process reengineering of ETL process) and full data overexposure protection.
Industry sector market interests:
- Agriculture
- Construction
- Consumer Products
- Culture
- Environment
- Fishery
- Forestry
- Information Technology and Telecommunications
- Manufacturing
- Medical/Biotechnology/Chemical
- Mining/Petroleum/Gas
- Service Industry
- Tourism
- Transportation
- Wholesale/Retail
- Aerospace
- Defence
- Automotive
- Food and Beverage Manufacturing
- Furniture and Wood Product
- Pulp and Paper
- Plastics and Rubber Products
- Primary and Fabricated Metal
- Electrical Equipment
- Textile and Clothing
Geographic markets:
Export experience:
- Germany
- United States
- California
- New Jersey
- New York
- Virginia
- Wisconsin
Actively pursuing:
- Algeria
- Australia
- Austria
- Belgium
- Brazil
- Brunei Darussalam
- Chile
- China
- Denmark
- Djibouti
- Egypt
- France
- Greece
- Hong Kong
- Iceland
- Indonesia
- Iran, Islamic Republic of
- Ireland
- Israel
- Italy
- Japan
- Jordan
- Kazakhstan
- Korea, Republic of
- Kuwait
- Kyrgyzstan
- Lebanon
- Libyan Arab Jamahiriya
- Liechtenstein
- Luxembourg
- Macao
- Malaysia
- Mexico
- Micronesia, Federated States of
- Monaco
- Morocco
- Netherlands
- New Zealand
- Oman
- Pakistan
- Qatar
- Russian Federation
- San Marino
- Saudi Arabia
- Singapore
- South Africa
- Spain
- Switzerland
- Syrian Arab Republic
- Taiwan
- Tajikistan
- Tunisia
- Turkey
- Turkmenistan
- United Arab Emirates
- United Kingdom
- Uzbekistan
- Yemen
- Alabama
- Alaska
- Arizona
- Arkansas
- Colorado
- Connecticut
- Delaware
- District of Columbia
- Florida
- Georgia
- Hawaii
- Idaho
- Illinois
- Indiana
- Iowa
- Kansas
- Kentucky
- Louisiana
- Maine
- Maryland
- Massachusetts
- Michigan
- Minnesota
- Mississippi
- Missouri
- Montana
- Nebraska
- Nevada
- New Hampshire
- New Mexico
- North Carolina
- North Dakota
- Ohio
- Oklahoma
- Oregon
- Pennsylvania
- Rhode Island
- South Carolina
- South Dakota
- Tennessee
- Texas
- Utah
- Vermont
- Washington
- West Virginia
- Wyoming
Sector information
Unique applications:
Data Integrity Institute Inc. offers Copula ETL Appliance, a first Application Specific ETL Appliance. Copula ETL Appliance is a complete ETL solution, including most advanced hardware (with massive parallel multi core CPUs, fastest external and internal networking, parallel disk arrays), load balancing parallel operating environment, most efficient ETL engine, and custom build according user business specification, ETL solution.Copula ETL Appliance is guaranteed to be implemented and delivered within 3 months after business specification is submitted, without any development on the user side. Copula ETL Appliance will execute specified ETL task within 3 hours. Copula ETL Appliance provides full metadata information with every running session (also useful for eventual process reengineering of ETL process) and full data overexposure protection.
ETL solution for extremely large volume of data of 2500 million records using Ascential DataStage in an NCR Teradata and IBM DB2 environment.
Key / Major clients:
Bank of Canada, Ottawa, ONBank of America, Dallas, TX
Bruce Power, Tiverton, ON
RBC Royal Bank, Toronto, ON
Wisconsin State Department Of Transportation, Madison, WI
Philip Morris, Richmond, VA
Blockbuster, Dallas, TX
Banque Nationale, Montreal, PQ
Hoffmann - La Roche, Nutley, NJ
Enbridge, Consumers Gas, Toronto, ON
Amex, American Express, Markham, ON Canada
Amex, American Express, Brighton, ES United Kingdom
CIBC - Canadian Imperial Bank of Commerce, Toronto, ON
Gap, San Francisco, CA
Bell Canada, Toronto, ON
Bell Canada, Montreal, PQ
WCB Alberta, Edmonton, AB
Ministry of Health, Ontario
ISM - IBM Global Services, Toronto, ON
d.d. synergy GmbH, Hamburg, Germany
Glaxo Wellcome Inc., Mississauga, ON
IBM Canada Ltd., Toronto, ON
CGI, Toronto, ON
CGI, Montreal, PQ
Westbury Canadian Life, Hamilton, ON
Success stories:
Data Integrity Institute Inc. has developed and through research continues to systematically improve upon some of the most efficient sorting algorithm based on advanced nanotechnology research.Today's major database and ETL engines use comparison based sorting algorithms derived from the well known quick sort, heap sort, merge sort, etc., algorithms. All of these comparison based algorithms operate on the item level, utilizing multiple active and passive comparisons per item.
Active comparison compares a selected item with other items by calling comparison functions customized to handle a comparison for a particular item type, which returns three possible outcomes: equal (0), greater than (+1), or less than (-1). The comparison process can be very slow because of the need to compare each item to other items multiple times and especially because of the customized comparison function call overhead.
Passive comparison checks each item several times for the position of that item in the memory array and/or in the file to ensure that that item is within its expected boundaries. This is done to eventually swap compared items and for many other purposes. The position of an item is represented by an integer, but it is still time consuming to perform the checks several times for each item.
Through its nanotechnology research initiative, Data Integrity Institute Inc. has developed the most efficient sorting algorithm that does not operate on the item level, but rather on the item sub level (nano level), and never performs either active or passive comparison of items. With such fine granulation there is no overhead making the sorting process fast and efficient.
Nanotechnology Sorting Statistics ( on a single workstation )
Nanotechnology Sorting sorts an array of 1 billion ( 1 000 000 000 ) double, 64 bit, floating point numbers (IEEE Standard 754), within one second on the single 4 core CPU with 24 GB RAM memory.
Workstation: Single
Motherboard: One
CPU: One
Cores per CPU: Four
RAM: 24 GB
Operating system: Either Microsoft Windows 64 bit or Linux 64 bit
Sorted data: Random generated array of 1 billion double, 64 bit, floating point numbers (IEEE Standard 754)
FPU used: No, all notechnology, sub item level access
Data types supported: Any
SQL data types supported: All SQL data types, including all types of Binary Large Objects (BLOB)
Most efficient data to sort: Integer
Linearity distortion (slowing) when sorted double, 64 bit, floating point numbers (IEEE Standard 754), data type, compared to 64 bit integer data type: Less than 0.01 (1%)
- Description: Sorting of an array of double, 64 bit, floating point numbers, will take no more than 1.01 of time of sorting of a same size array of 64 bit integers
Linearity distortion (slowing) when size of a sorted array is doubled: Less than 0.01 (1%)
- Description: Sorting of a twice size array of same data type will take no more than 2.01 of time
Nanotechnology Sorting ( hard drive operations )
When an array to be sorted exceeds the amount of available RAM so hard drive is used, Nanotechnology Sorting accesses a hard drive in following steps:
1. Sequential read of data
- Description: Data are read sequentially in very large blocks, depends of available RAM. Next block is taken next from previous, and so on.
2. Sequential write of data
- Description: Data are written sequentially in very large blocks, depends of available RAM. Next block is placed next to previous, and so on.
3. Random read of data
- Description: Data are read randomly, rather in large blocks (still not each item individually), depends of available RAM. Next block is taken according internal Nanotechnology Sorting algorithm logic.
4. Sequential write of data
- Description: Data are written sequentially in very large blocks, depends of available RAM. Next block is placed next to previous, and so on.
Number of hard drives support: Limited by operating system
Number of files support: Limited by operating system
Size of files support: Limited by operating system
Number of nodes support: Limited by operating system
Size of RAM support: Limited by operating system
Support for multiple operating systems for a single sorting session: Yes
Number of multiple operating systems for a single sorting session: No limit
Support for different operating systems for a single sorting session: Yes
Number of different operating systems for a single sorting session: No limit
Contact Data Integrity Institute Inc.
For further information on Data Integrity Institute Inc.'s research or how Data Integrity Institute Inc. can help you to implement, save or maintain an enterprise ETL or metadata project, please send a detailed inquiry to: info@DataIntegrityInstitute.com , or call (416) 282-2298
Testimonial:
http://www.b-eye-network.com/blogs/linstedt/archives/2007/02/bidw_appliances.phphttp://www.dataintegrityinstitute.com/Enterprise_Metadata_System.htm
http://www.dataintegrityinstitute.com/Impact_Analysis.htm
http://www.dataintegrityinstitute.com/Data_Lineage.htm
http://www.b-eye-network.com/blogs/linstedt/archives/2007/05/where_o_where_i.php#more
Note: This document is presented in the language provided by the author/source. Most of the information contained in Canadian Company Capabilities (CCC) has been provided by sources external to Industry Canada. The accuracy, currency and reliability of the information contained in CCC are the sole responsibility of the registered companies and related organizations. Industry Canada assumes no responsibility in this respect. The registered businesses shall ensure the continued monitoring of the information contained in CCC and ask that it be modified when necessary.
-
Date Modified: 2013-05-19