550 likes | 706 Views
Adapting OpenClinica for International Collaborations. Andy Lin University of Michigan June 21, 2013 OpenClinica Global Conference 2013. Overview. Background Optical mark recognition OpenClinica on bootable USB drive Embedding audio files into OpenClinica CRFs
E N D
Adapting OpenClinica for International Collaborations Andy Lin University of Michigan June 21, 2013 OpenClinica Global Conference 2013
Overview • Background • Optical mark recognition • OpenClinica on bootable USB drive • Embedding audio files into OpenClinica CRFs • OpenClinica on iPad mini
Background • UMHS-PUHSC Joint Institute • Collaboration between University of Michigan Health System and Peking University Health Science Center • Started in October 2010 to support collaborative clinical and translational research • Studies conducted jointly by teams at each university
BRBI Core • Biorepository and Biomedical Informatics • Set up technological infrastructure for supporting research data collection • Establish standardized process for securing and storing biological specimens and clinical data • Facilitates information sharing and data management
Research Data Collection • Selected OpenClinica after comparison with other solutions • Using Enterprise Edition in at UM, run by MICHR • Using Community Edition in China • Enterprise Edition not available in China at the time • Majority of clinical data collection occurs in China • Data collection in China presents a variety of challenges
Challenges of Data Collection in China • Paper forms requested by study teams • Required to retain the original data on paper forms by IRB or departments • Not comfortable with computer interface for direct data entry • Limited internet access in rural areas • Language barrier / low literacy in rural areas • Potential for subjects to not be able to read and understand survey • Not all interviewers are native speakers • Variety of dialects in China – not everyone is fluent in Mandarin, especially in rural areas with older population • Data collection via tablets requested • Superior battery life to laptops • Generate additional interest in study participation
Overview • Background • Optical mark recognition • OpenClinica on bootable USB drive • Embedding audio files into OpenClinica CRFs • OpenClinica on iPad mini
Optical Mark Recognition • What is OMR? • Recognizes check box and multiple choice bubbles • Bubble forms / Scantrons • Standardized tests, election ballots • OMR software allows desktop computers to process data • History of usage in clinical research • Proposed for use with health care data in the 1970s • Popular in 1990s and early 2000s for transcribing to database • Recent shift towards EDC and tablets for data entry
Optical Mark Recognition • Current implementation • Paper forms still used by many studies • Transcribe to EDC or other study management software • Scanners are faster and cheaper • OMR software • Remark Office OMR • Allows users to design their own forms • Works with all common image types • Exports data in many formats – Excel, SPSS
Optical Mark Recognition • For studies using paper data collection forms • Requirements of some studies • Paper forms can be used without internet access or power • Multiple surveys can be administered at a time with self-reporting forms • Increases efficiency and accuracy of data transfer from paper forms to OpenClinica • Cost-effective solution if reused for multiple studies
Process Overview • Design forms using Remark software • Create template for reading scanned images • Collect data and scan completed forms • Extract data using Remark image reader • Use Python script to convert data into OpenClinica XML import format • Import data into OpenClinica
Scan completed form as an image Capture data using paper data collection forms Import data into OpenClinica Use OMR software to read data from the scanned image
Form Design • Can use any word processor or desktop publishing software • Use bubbles for multiple choice questions • Special font provided by Remark • Can also set up areas for text recognition • Remark does not have handwriting recognition • Requires plenty of whitespace between bubbles and text recognition areas
Template Design • Print and scan blank form for template base • Insures layout will match scanned completed forms • Open image in Remark template editor • Highlight areas with bubbles and text entry • Assign labels and values to bubbles • Allows grouping of questions with same responses (i.e. Likert scale) • Create OpenClinica CRF first, and match the region names to the OpenClinica item names
Template Design Blue region: Text Green region: OMR Specify number of bubbles and respective values in properties
Data Collection and Scanning • Forms should be completed by filling in bubbles • Check and ‘X’ marks can be too light or extend into other bubbles • Dark pen gives optimal results • B&W setting works well if forms are filled out correctly • Grayscale can pickup lighter marks or text • Minimum of 150 dpi recommended for OMR
Image Reading Process • Displays form region during reading process • Detects missing or extraneous marks • Pauses reading for correction • Detects written text in text regions • Pauses reading so user can enter text • Setting to identify and ignore large ‘X’ marks • Allows users indicate incorrect marks
Conversion to XML Format • Export data from Remark in tab-delimited text file • Add subjects to OpenClinica • Allows script to query subject IDs • Get study and form IDs from OpenClinica • Python script reads data from text file and converts into OpenClinica import format
Conversion to XML Format Tab-delimited format with header XML Format
Importing Data • XML file configured for OpenClinica’s Import Data feature • Performed by interviewer if tracking interviewer is important • Subjects must be created in advance with our method • Could use web services to automate the subject creation process
OMR in Practice • Used for 2012 summer project for UM medical students • Survey on knowledge of Hepatitis C and motivation for clinical research participation • OMR forms used at UM and at rural area near Beijing • Subjects at UM completed forms on their own • Multiple subjects taking survey at a time • Clear instructions are essential • Forms filled out by interviewers in China • Accuracy was much higher than UM forms
Efficiency and Accuracy • Used OMR process for JI-supported medical student project • 64 forms entered with entry from paper form, 111 forms entered with OMR process • 7 minutes per form for data entry • 2 minutes per form for entire OMR process • Scanning, reading, and importing • Scanned 90 sheets at a time • Efficiency of OMR improves as more data is proccesed at a time • OMR error rate of 2.2% compares favorably with data entry error rates from published studies of 2-10%**
Overview • Background • Optical mark recognition • OpenClinica on bootable USB drive • Embedding audio files into OpenClinica CRFs • OpenClinica on iPad mini
Bootable USB Drive • Developed by Manhong Dai • Bootable USB drives running OpenClinica on Linux OS • Can run OpenClinica on laptops without internet access • Cost-effective if interviewers already have their own laptops • No need to set up software on interviewer’s laptops • Single configuration for multiple computers • USB drive can be transferred between computers • Interviewers could share a single USB drive if not conducting surveys at the same time
Features • Complete setup of Linux, Apache, Tomcat, Postgres, and OpenClinica on 8GB USB drive • Persistent • Allows changes to be saved between sessions • Entire setup can be backed up and/or copied to new USB drives • Automated backup of OpenClinica database • Direct connection to server at U-M to provide IT support
Operating System • Unetbootin • Software to create bootable USB drives for Linux • Supports many distributions out of the box • Can supply ISO file to use other distributions • Lubuntu • Lightweight version Linux distribution based on Ubuntu • Targeted at users who don’t use command line • Very low hardware requirements • Up-to-date drivers supporting most laptop hardware
Operating System • Casper • Creates “Live” USB • Run preinstalled operating system from read-only media • Configured automatically by Unetbootin • 4GB allocated to persistent storage space • Need enough disk space for applications and OpenClinica database • Overlays the file system and copies all changes • Image file can be saved as regular file on Windows or Mac
Drivers and Software • Some laptop hardware may require additional drivers • Older version of Postgres required by OpenClinica • OpenClinica requires 8.4, default Lubuntu version is 9.2 • Default browser Chromium causes Javascript errors in OpenClinica • Firefox available for Linux
Backup and IT Support • USB drives have a direct connection with U-M server • Drives identified with a configuration file • Files backed up automatically every 30 minutes with shell script • Requires internet access • Script can also be run manually • Only creates new backup if changes are detected • Allows access to the Linux instance on the USB drive • Access to OpenClinica on USB if needed
USB Drives in Practice • Set up for 2013 summer project for UM medical students • Survey on motivations for participating in clinical research • Students will stay in China for 2 months • Two study sites: PKU clinic and rural clinic • No internet access at rural clinic • Small budget for project; needs to cover flight and living expenses • Using student’s personal laptops
Wireless Local Network Laptop booting from USB drive operates as a local OpenClinica server Laptop server connects to router wirelessly or via Ethernet Mobile devices connect to the laptop server through a wireless network
Overview • Background • Optical mark recognition • OpenClinica on bootable USB drive • Embedding audio files into OpenClinica CRFs • OpenClinica on iPad mini
Embedding Audio Files • Requested for surveys collecting data in rural China • Concerns over literacy of subjects and language barrier • Ensures consistent administration of survey • HTML entered in “RIGHT_ITEM_TEXT” column of OpenClinica CRF Excel spreadsheet • HTML5 <audio> tag • <embed> tag included for older browsers
Embedding Audio Files <audio controls preload> <source src="/OpenClinica/audio/section1/1.1.mp3" type="audio/mpeg"> <source src="/OpenClinica/audio/section1/1.1.ogg" type="audio/ogg"> <embed autostart="false” src="/OpenClinica/audio/section1/1.1.wav"> </audio> Audio controls in Firefox
HTML5 <audio> tag • Standard for playing audio files • Supported in all popular modern browsers • Different browsers require different formats • IE only supports MP3 files • Audio player built into browsers • User does not need to install plugins
Implementation • IE only supports MP3 format, but Firefox does not support MP3 in all versions • Need both MP3 and Wav/Ogg formats • Audio files stored in directory in top OpenClinica directory • Using <audio> tag avoids need for additional browser plugins
Audio Files in Practice • Embedded in CRFs for 2013 summer project for UM medical students • Rural site has older population with low education and literacy levels • Higher chance for older people to not speak Mandarin fluently • Difficulty of reading survey increased by medical terms • Students are not native Chinese speakers • Subjects that need to have questions read to them will hear the same file
Overview • Background • Optical mark recognition • OpenClinica on bootable USB drive • Embedding audio files into OpenClinica CRFs • OpenClinica on iPad mini
OpenClinica on iPad mini • Study team request to support data collection on iPad mini • Want to keep OpenClinica as standard software for data collection • Chrome instead of Safari • Also need to update Javascript libraries for full functionality • Limit number of questions per section in CRFs to reduce scrolling • Modifications to interface required • Input elements and buttons are too small to select • Constantly zooming in and out undesirable • Images needed to be resized for some interface buttons
Modifying OpenClinica with CSS • CRFs can be modified using inline styles • Increasing size of text and input elements for CRF not enough • CSS (Cascading Style Sheets) allow us to: • increase spacing between elements • increase size of elements not in CRF • modify other forms (Add New Subject, Schedule Study Event) • hide input elements not used by studies • Insert separate style sheet instead of editing current file • Avoid changes to code; makes upgrading difficult
Modifying OpenClinica with CSS • Include new style sheet • Required to add one line to code for pages we want to modify to link to our style sheet • Use media query to only change appearance when resolution is below a specific value • Resolution of iPad mini is 1024x768 • <link rel="stylesheet" href="includes/ipad.css" type="text/css" media="screen and (max-width: 1024px) "> • Desktop version is unaffected unless they decrease width of browser
Modifying OpenClinica with CSS • Target elements with CSS selectors to avoid making global changes • CSS attribute selector to specify form names form[action='AddNewSubject'] td, form[action='CreateNewStudyEvent'] td { padding: 3px; } • ID selector #login form, #login form input { font-size: 16px; }
Modifying OpenClinica with CSS • Hide elements on Add New Subject form form[action='AddNewSubject'] input[name='submitEnroll'], form[action='AddNewSubject'] input[name='submitDone'] { display: none; } • Resizing buttons • Buttons use a fixed image that we can’t resize • Remove image and recreate button using CSS #login form .loginbutton { background-image: none; background-color: #789ec5; padding: 0.2em 0.5em; border-radius: 0.3em; }
OpenClinica on iPad mini • Planning to use for study on Fatty Liver Disease in Pinggu • Most data collection will take place in very high-volume clinics • Interviewers will have to move around and cannot keep a laptop charged • Tablets have superior battery life • Study team is also expecting that using tablets for data collection will generate extra interest • Uncertain about reliability of internet access – may try to use laptop as portable server