Accession Number : ADA316883

Title :   Quarterly Technical Report. Massive Data Analysis Systems.

Descriptive Note : Quarterly rept. Jun-Aug 96,

Corporate Author : SAN DIEGO SUPERCOMPUTER CENTER CA

Personal Author(s) : Frost, Richard ; Wan, Mike ; Baru, Chaitanya ; Moore, Reagan ; Marciano, Richard

PDF Url : ADA316883

Report Date : 07 OCT 1995

Pagination or Media Count : 50

Abstract : The creation of a Massive Data Analysis System (MDAS) will enable new modes of science through improved data management of scientific data sets. This requires a scalable software infrastructure that can manage petabytes of data, support rapid access of selected data sets, and provide support for subsequent computationally intensive analyses. To accomplish this, object-relational database technology is being integrated with archival storage systems. By supporting transportable methods for manipulating the data, it then becomes possible to analyze selected data sets on remote systems. The MDAS becomes a scheduling system, managing the flow of data and computation across distributed resources. Usage models are needed that simplify the identification, transport and analysis of large collections of data. The system must automate the collection of metadata describing the data set attributes, and handle interactive WEB access, distributed database access, and discipline specific application interfaces. A software infrastructure has been designed which manages user access restrictions, matches application requirements with resource availability, and schedules the data movement and application execution. Development of this software system is proceeding on schedule, with selected applications testing the initial prototypes.

Descriptors :   *COMPUTER PROGRAMS, *DATA BASES, *DATA MANAGEMENT, *INFRASTRUCTURE, TEST AND EVALUATION, DATA PROCESSING, REQUIREMENTS, GLOBAL, COMPUTATIONS, INFORMATION SYSTEMS, INTERACTIONS, INTERFACES, PARALLEL PROCESSING, AVAILABILITY, SCHEDULING, SCALING FACTOR, TRANSPORTABLE, USER NEEDS, ACCESS, RESOURCES, STORAGE, ARCHIVES, COLLECTION, REMOTE SYSTEMS.

Subject Categories : Computer Programming and Software
      Computer Systems

Distribution Statement : APPROVED FOR PUBLIC RELEASE