Hans Morsch1, Evarist Msaki



Yüklə 5,17 Kb.
tarix01.04.2017
ölçüsü5,17 Kb.
#13131
Data migration of consistent Sickle Cell Disease clinical-database at Muhimbili National Hospital in Tanzania.

Raphael Z Sangeda1, Hans Morsch1, Evarist Msaki1, Frederick Mbuya1, Sharon Cox1, Amel Ghouila2, Bruno Mmbando1, Alia Benkahla2, Charles Newton1,3, Kais Ghedira2, Julie Makani1.

1 Muhimbili University of Health and Allied Science, Dar es Salaam,Tanzania; 2 L'Institut Pasteur de Tunis (IPT),Tunisia; 3 KEMRI-Wellcome Trust Research Programme CGMRC,Kilifi Kenya)

Background

Proper usage of biomedical data can be useful in designing future research strategies and in patient management and care. Exploring the clinical data with text mining allow the prediction of healthcare costs, disease diagnosis and prognosis, and the discovery related healthcare patterns from other databases establishing, relationships among diseases, and relationships among drugs. Moreover; clinical research on the data can allow discovering disease intervention strategies.

However, exploration of clinical data can be hindered if the database is not properly designed, un-normalized or contain redundant data. We hereby describe the automated migration of a clinical database containing cohort data of Sickle cell disease (SCD) patients attending clinic at Muhimbili National Hospital, in Dar es Salaam, Tanzania. The main objective was to migrate in the old SCD database into a new consistent database.

Methodology

The new database was modeled using MySQL-Workbench by consulting the study case report forms (CRF) and the old database.

Mapping of new database columns was matched with the new database columns to get a definitive column for migration. Values in the old database were checked against valid range of values and text values were recorded to uniform values for all clinical variables. Consistency check within and between tables was done and data cleaned. The definitive column mappings were made and these were used to guide the automated migration from old to new database.

Consistency, range check and mapping were done using python scripts or VBA scripts. The input to the script was made via comma separated values (csv) files.



Results

In total 1768 columns in 33 tables of the old database were considered important to migrate into 622 columns within 27 new tables in the new database. Nine pairs out of the 33 tables were double entry tables, which were merged in the new database design. The resulting mapping comprised of 1712 column pairs where related old columns were migrated to the same new column.



Conclusions

The newly created repository will facilitate analysing the clinical data and generate new interventions for SCD
Yüklə 5,17 Kb.

Dostları ilə paylaş:




Verilənlər bazası müəlliflik hüququ ilə müdafiə olunur ©azkurs.org 2024
rəhbərliyinə müraciət

gir | qeydiyyatdan keç
    Ana səhifə


yükləyin