Accession Number : ADA319624

Title :   Report on Work in Progress: Massive Data Analysis Systems.

Descriptive Note : Quarterly technical rept. Oct-Dec 96,

Corporate Author : SAN DIEGO SUPERCOMPUTER CENTER CA

Personal Author(s) : Frost, Richard ; Baru, Chaitanya ; Marciano, Richard ; Wan, Mike ; Moore, Reagan

PDF Url : ADA319624

Report Date : 08 JAN 1997

Pagination or Media Count : 222

Abstract : The creation of a Massive Data Analysis System (MDAS) will enable new modes of science through improved management tools for scientific data sets, computational methods, and computational resources. To provide these capabilities, MDAS researchers are developing a software infrastructure to support data location transparency, access transparency, and conversion transparency in a heterogeneous, distributed systems. This requires a scalable software infrastructure that can manage petabytes of data, support rapid access of selected data sets, and provide support for subsequent computationally intensive analyses. The system must automate the collection of metadata describing data sets, computational methods, resources, and useers. Some of the core technologies being used to provide this functionality include object-relational database systems, archival storage systems, parallel I/O, third-party transfers, and method-level authentication. By supporting transportable methods for manipulating the data, it then becomes possible to analyze selected data sets on remote systems. The MDAS becomes an infrastructure to build next-generation operating and scheduling systems which can manage the flow of data and computation across distributed resources.

Descriptors :   *DATA BASES, *COMPUTATIONS, *DATA MANAGEMENT, *TOOLS, *REPORTS, *RESOURCES, *NUMERICAL METHODS AND PROCEDURES, COMPUTER PROGRAMS, CONVERSION, POSITION(LOCATION), QUICK REACTION, INFORMATION SYSTEMS, DISTRIBUTED DATA PROCESSING, PARALLEL PROCESSING, SCHEDULING, SCALING FACTOR, TRANSPORTABLE, ACCESS, FLOW, STORAGE, ARCHIVES, REMOTE SYSTEMS, TRANSPARENCIES, INFRASTRUCTURE.

Subject Categories : Information Science
      Numerical Mathematics
      Computer Programming and Software

Distribution Statement : APPROVED FOR PUBLIC RELEASE