
|
|
Bioinformatics, Biostatistics and Computational Analysis Core Facility
Purpose
The work in the Center involves large epidemiology studies, developmental toxicology, animal studies, and gene expression studies involving microarrays. These efforts generate large databases that require the extensive use of complicated statistics. To accommodate these needs, MCTEH has a Bioinformatics, Biostatistics, and Computational Analysis Core Facility.
Services
This core facility’s functions include: Assistance with statistical study design and analysis of results; development of data collection forms and manuals of procedures; data management; and development of new statistical methodologies. In most cases, it is expected that the Principal Investigator will send a technician, graduate student or post-doctoral fellow to the Core to be trained in the statistical analysis of a particular assay. After training is completed, that individual would be expected to perform the analysis for the project with confirmation by the statistical expert who acts as a collaborator on the projects. In some instances, full analysis will be provided for the duration of the projects and the time for the core facility manager or faculty member will be charged at a nominal hourly rate. Data management and backup are provided to all investigators beginning at the time of their laboratory/office setup to ensure maximum protection of scientific effort.
Management
The present Director of the Biostatistics, Bioinformatics and Computational Analysis Core Facility is Dr. W. Douglas Thompson. Dr. Thompson has extensive experience with biostatistics and epidemiological study design. Currently, he is overseeing the training and supervision of technical staff and students in biostatistics.
Equipment
To enable the required computations and database management and manipulations, the core is equipped with substantial computing power through a variety of servers and powerful Beowulf clusters. It is expected that additional computing power will be added through start-up packages for the hires in biostatistics and bioinformatics. For data management and backup, MCTEH data is supported by a Dell PowerVault 770N network-attached storage server utilizing Windows Storage Server 2003 operating system. The PowerVault operating environment is protected using a RAID 1 while the data array is RAID 5 configured. It has storage capacity for 17.2 terabytes (TB) of information and is configured with redundant cooling units and power supplies. Procedures and equipment are also in place for backup, storage, archive, and retrieval of data using an Overland Neo Series 4000 Backup library. This backup system employs expandable, state-of-the-art backup technology and has the capacity to backup up to 31.2 TB and over 1 TB/h of performance. The Overland features LiveSwap(TM); tape drives, removable power supply and redundant robotics. This provides the Center with the capability to backup more than 124TBs of data. Backups occur daily and a duplicate set of backup tapes are rotated weekly through an off-site safety deposit box to protect the data in case of disaster.
|
|
|
|