Teradata (page under construction - still you may find it very useful)
- http://www.teradata.com - Teradata - database engine optimized for Datawarehousing (for reading, not for inserting). Used for huge data warehouses (Fortune 100 companies, for example, major arilines). Over 25 years of experience implementing over 1,900 data warehouses worldwide.
Examples of clients: PayPal (migrated to Teradata form Oracle), Verizon Wireless (~50 mln clients), Wachovia Bank, Delta Airlines (consolidated 27 data-marts into one data-warehouse), American Red Cross, CVS/ Pharmacy, United States Airforce.
90% of the Top Global Telecommunications Companies
50% of the Top Global Retailers
70% of the Top Global Airlines
60% of the Top Global Transportation Logistics Companies
40% of the Top Global Commercial and Savings Banks
Teradata was spun-off from NCR in 2007 (NCR = National Cash Register: http://www.ncr.com).
http://www.info.teradata.com/ - documentation.
Basically it is a SQL database specifically enginnered for building big datawarehouses. Uses parrallel processing for everything. It can scale in all dimensions almost infinitely. Can run on Unix and Windows servers.
- It uses architecture of independent and self-sufficient nodes. Performance grows linearly with the number of added nodes (processors, servers). Performance does not deteriorate with increasing numbers of simultaneously working users.
- Unlimited parallelism between nodes / servers leads to unlimited efficiency (time to get 100 rows can be the same as to get million rows).
- Up to 64 joins in a query.
- char, varchar, long varchar
- integer, decimal, float, byte, long varbyte, byteint
- date, time, timestamp
- interval day, interval day to hour, interval day to minute, interval day to second, interval hour, interval hour to minute, interval hour to second, interval minute, interval minute to second, interval second, interval month, interval year, interval year to month, timestamp with time zone, time with time zone
- Basic Teradata Query facility (BTEQ) - general-purpose command-line tool - to run (sql, import/export) commands from prompt or to call from batch/shell scripts.
- FastLoad - initial load into empty table, ver fast, parrallel
- FastExport - very fast export from tables or views
- MultiLoad - uses block-level operations for fast updates, inserts, upserts, and deletes large volumes of data in tables.
- TPump - continuously load data into Teradata tables without
locking the affected table.
- Teradata Parallel Transporter - combines FastLoad, FastExport, MultiLoad, and TPump
- Teradata Replication Solutions
Teradata provides a number of standardized interfaces: http://www.teradata.com/t/page/44083/index/html
- ODBC Driver for Teradata
- the JDBC Driver for Teradata
- the OLE DB Data Provider for Teradata
- the Microsoft .NET Data Provider for Teradata
- the Teradata Call-Level Interface (CLI)
- the TS/API, which permits applications that normally access IBM DB2 to run against Teradata.
- interfaces for Sybase Open Server, Open Client, etc.
- http://search.cpan.org/src/DARNOLD/DBD-Teradata-1.50/doc/tdatdbd.html - docs and examples
- http://search.cpan.org/src/DARNOLD/DBD-Teradata-1.50/doc/tdatdbdfl.html - using Fastload with DBD::Teradata
- http://search.cpan.org/src/DARNOLD/DBD-Teradata-1.50/doc/tdatdbdfx.html - using Fastexport with DBD::Teradata
- http://search.cpan.org/src/DARNOLD/DBD-Teradata-1.50/doc/tdatdbdml.html - using Multiload with DBD::Teradata
$dbh = DBI->connect('dbi:Teradata:hostname', 'user', 'password');
... (standard DBI syntax)