Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
Biomedical Computing and Standards Daine Richard Lesniak Computer Science and Software Engineering Department University of Wisconsin – Platteville Platteville, WI 53818 [email protected] Abstract Bioinformatics and medical computing hereafter referred to jointly as biomedical computing, are both areas with significant challenges, great risks, and even greater rewards. Some of the challenges encountered in biomedical computing include extremely large datasets, incompatible data vocabularies, human safety, and supporting geographically distant collaboration while maintaining security. To achieve the rewards and overcome the challenges solid software engineering approaches must be taken, and one particular aspect of software engineering that has had great success in biomedical computing is the use of standards. This paper will address the benefits of standards, existing standards, and the creation of internal standards with respect to biomedical computing. Special attention will be given to the IEEE 1073 family of standards and standards associated with the recently enacted Health Insurance Portability and Accountability Act. Standards will be evaluated in the context of four areas of biomedical computing: Process Control, Archiving, Numerical Processing, and communications, as detailed in “Bioinformatics Computing”, by Bryan Bergeron, MD. [1] Introduction BioMedical Computing Biomedical computing is the use of computer technology to assist with endeavors in the fields of biology and medicine. Biomedical computing is an area of much interest lately, with advances in healthcare and biotechnology being increasingly tied to computers. Due to this increased need for biomedical computing, many educational institutions are offering graduate degrees in computational biology and bioinformatics. The demands of industry and the emergence of academic programs have made biomedical computing an area of active research. Biomedical computing covers a broad spectrum of possibilities, including everything from pacemakers to laboratory automation to databases full of patient records. What makes biomedical computing interesting in that it is not a technology so much as a domain; a broad set of problems that must be met with whatever technology is available and appropriate. While biomedical computing covers many different types of technology being used in many different ways, biomedical computing does address four main areas: process control, archiving, numerical processing, and communications. Standards Standards are a useful tool in software engineering. Standards allow for reuse of procedures and practices that have proved helpful in the past, hopefully keeping past mistakes from being repeated while allowing success to be duplicated. Standards, like biomedical computing, take many forms. Standards can be used to dictate how various devices are to communicate with each other, the coding style to be used on a project, and various safety criteria a program must meet before it can be put into general use. HIPAA One recent reason for an increase in standards for biomedical computing is HIPAA, the health insurance portability and accountability act.[2] HIPAA covers a wide variety of subjects, but one of particular interest is the aspect of HIPAA that addresses the security of patient information. HIPAA has authority over heath plans, health clearinghouses, and healthcare providers; it also requires these organizations to enter a chain of trust with anyone they make patient information available to. This means that it is the responsibility of the covered entity to ensure anyone they provide patient information to must also be compliant with HIPAA. Failures in HIPAA compliance can be very costly due to the fact that the penalty is a lawsuit on behalf of the injured party rather than a fine.[3] HIPAA gives general guidance on issues of privacy and security, and requires covered entities to generate, document, and implement internal standards that “make sense”. Process Control Process control consists of interfacing computer systems with the real world in order to control a physical system or gather information from sensors. Process control in biomedical computing can be implantable medical devices such as pacemakers, automated laboratory equipment such as micropipette machines, and information gathering machines such as the optical scanners used in micro array experiments. When used to automate lab procedures, process control can not only speed up the initial setup of experiments, but can also reduce error inherent in having humans do repetitive tasks. Process control can also be used in instances where exposure to dangerous materials is required; reducing the amount of human handling that must take place. One common biomedical activity that incorporates a great deal of process control is a microarray experiment. In a microarray experiment a microarray is prepared with spots of known DNA samples, this initial preparation is often accomplished by using an automated micropipette machine. A reference and probe sample, both constructed to fluoress a different color, are then allowed to competitively hybridize with the known DNA samples on the microarray; the more nucleotides the microarray spot and reference or probe have that complement each other the stronger the attraction. The microarray is then fluoressed, and an optical scanner is used to gather the information. The greatest challenge faced by the process control aspect of biomedical computing is ensuring human safety. Devices such as pacemakers and medical monitors are under the regulation of the FDA, which is responsible for certifying their fitness for use and performing such tasks as recalls when a defect is found in an already released device. One such recall is the recent recall of the LIFEPAK 500, an automated defibulator. The recall was a class 1 recall, which means that there was a likelihood that use of the device would cause serious injury of death; the company was required to replace all the affected units at no cost [4]. Clearly the challenge of human safety in process control and biomedical computing should be held foremost, not only for the intrinsic value of human life but for the financial and publicity burdens a technology company can face should a device prove unsafe. Standards play a central role in Process control. As previously stated, it is the responsibility of the FDA to decide if medical devices are safe for use. The process of getting a device approved can be expedited by use of a declaration of conformity which shows that the device in question conforms to a recognized safety standard.[5] While this may not be sufficient for a device to be fully approved, it certainly helps. One set of standards currently being worked on is the IEEE 1073 family of standards.[6] The IEEE 1073 point of care medical device communication standards specify formats, speeds, and communication protocols for medical devices. The goal behind this is to make medical devices more compatible with each other, allowing for standard plug and play operation. By doing this, human safety is increased; information that would have to be transferred to a paper intermediary or committed to memory then reentered on another device can now simply be transferred at a lower risk of error. The IEEE 1073 family of standards covers such devices as vital signs monitors, defibulators, and weight scales. Archiving The archiving aspect of biomedical computing involves storing biomedical data and metadata, and ensuring that the data can be accessed in a timely manner. Archiving is usually thought of in terms of databases, but also involves backups, access control, and data formats. Many types of information are archived in biomedical computing, from patient records to gene sequences to three-dimensional representations of proteins. Biomedical databases vary quite a bit in size and scope, from proprietary databases maintained by biotech companies to house data from their own labs to large public databases such as Swiss-Prot and GenBank. Swiss-Prot and GenBank are large public databases for protein information and gene sequences respectively. One challenge faced by archiving is the extremely large datasets found in biomedical computing. GenBank alone holds over thirty million gene sequences [7], and this database would most likely be utilized with several other public and private databases in order to achieve something useful. Another challenge faced by archiving in biomedical computing is the fact that there is a plethora of conflicting data vocabularies that must be reconciled with each other. Incompatible data vocabularies occur not only due to the differences in the type of information describes, such as disease symptoms and patient history, but there are many conflicting data vocabularies for the same type of information as well. Maintaining the security of archived information is also a significant challenge. This is especially important with the enacting of HIPAA, which states that archived information which is associated with patients must be protected in terms of confidentiality, integrity, and availability. Protecting the confidentiality of archived information consists of making sure only those who should have access have access, and that patient information is not kept on insecure mediums. One example of this is that hospitals now will not leave patient information on answering machines, as anyone can come and push play to hear the message. Protecting the integrity of archived information is ensuring that the information is not modified in any way that it should not be, and that when retrieved, it is indeed the information left in the first place. Availability of information must be ensured as well, meaning that if someone needs the information, and they are indeed entitled to it, they can retrieve it in a timely manner. This is extremely important, given that information such as allergies to medications can be needed on short order and can be life critical. HIPAA plays a very large role when it comes to archiving. It requires standards to be generated, documented and implemented that address protecting archived data’s confidentiality, integrity, and availability. HIPAA recommends first assessing the risks and vulnerabilities to the information, developing and implementing security measures that fit the particular needs of the organization, and then documenting these in the form of an internal standard. There are specific things that HIPAA requires in the security standard developed, as stated in the following list: 1) 2) 3) 4) 5) 6) 7) The standards shall include access control The standards shall include some form of cryptography The standards shall include a data backup plan The standards shall include a disaster recovery plan The standards shall include an emergency operation plan The standards shall be periodically reviewed and tested The standards shall include a timetable for requirement 6 Most of the required aspects of the archiving security standard are self-explanatory; one thing that may be unfamiliar is the emergency operations plan. The emergency operation plan is how the system will function in the event of a disaster, which could be a physical problem such as a tornado going through the server building, a hacker corrupting data, or something else unexpected that disrupts normal operations. The disaster recovery plan must identify which information must be available no matter what, and needs to have a course of action that will make this information available until the full system can be restored. It should be noted that HIPAA attempts to be technology neutral and not prescribe any specific solutions. This allows internal standards to be developed to comply with HIPAA and still take advantage of improvements in technology. An internal standard that is very useful when taking advantage of numerous public and private databases is a data dictionary. A data dictionary is an internal standard that describes what format information will take, and how to translate various formats into the standard format. By utilizing a data dictionary in this way, diverse datasets can be tied together into a single data mart.[1] Numerical processing Numerical processing in biomedical computing is when computers are used to analyze and process information. One common numerical processing task is the prediction of protein structure using statistical and machine learning algorithms. This is extremely important in biotechnology, as proteins are the building blocks of organisms, and their behavior and function is determined largely by the proteins structure. Numerical processing is also used in gene sequencing. In gene sequencing geneticists first split a genome into many smaller fragments, decode these, then utilize pattern-matching algorithms to reassemble the decoded fragments into an entire decoded genome. Simulation and visualization of biological processes are also important aspects of numerical processing. These are used to perform in silica experiments, which can be used to find where more costly wet lab experiments are warranted. Data mining is also a useful numerical processing activity in biomedical computing. Data mining makes direct use of the large archived datasets by searching them for relationships that could be of interest. Numerical processing faces many of the challenges archiving does with respect to large and incompatible datasets. This is especially true for data mining, which must make use of different datasets to find relationships. The data dictionary created for archived information proves invaluable to numerical processing tasks. Whether data mining or simulation, numerical processing requires information to be processed, and that information must be understood, this understanding comes from the metadata description contained within the data dictionary. While not a formal standard, OpenGL has become the defacto standard for threedimensional graphics in biomedical computing.[1] Programs that facilitate visualization of biological data invariably use OpenGL, in part because of the inertia it has in the field. The more projects it is used on, the more expertise is invested in it, and the more likely future projects will be implemented in OpenGl. Communications In the past ten years computers have revolutionized communications, and this has impacted biomedical computing a great deal. Archiving the mass of biomedical information would be of little value without the ability to transfer the information from its database to the end user via networks. Computer communication technology not only aids biomedical researchers with email, file transfers, and web searching, but also allows for collaborative software. Collaborative software allows for two or more people to operate and share control of the one running program. This is very useful for collaboration between geographically distant people. A popular program for molecular modeling is chimera, which also has a collaborative extension to allow for remote collaboration [1]. The primary challenge faced by the communications aspect of biomedical computing is security. If information being communicated is patient information HIPAA comes into play, but even if the communications are outside the scope of HIPAA it is important to address the security of communications. Proprietary biomedical information can often be worth quite a bit of money, with 2001 pharmaceutical R&D adding up to thirty billion dollars. [1] It is also important to note that there are active groups that oppose certain aspects of biotechnology, and hacktivism is a real possibility. HIPAA requires entities that communicate patient information to generate, implement, and document standards for communications security in the same manner that they are required for archiving security. Like archiving, the communications standards are open ended to allow for incorporation of new technologies, and are intended to make sense for the covered entity. The required items in a HIPAA compliant communications are as follows: 1) The standards shall include data integrity checks 2) The standards shall include message authentication 3) The standards shall include entity authentication 4) The standards shall include access controls 5) The standards shall include some form of encryption 6) The standards shall include alarms 7) The standards shall include audit trails 8) The standards shall include event reporting 9) The standards shall be periodically reviewed and tested 10) The standards shall include a timetable for requirement 9 Alarms and event reporting are two required aspects of a HIPAA compliant communications security plan that warrant further explanation. Alarms are a combination of being able to detect when something has happened, and being able to alert the proper individuals. This could be information that was sent but never received causing an email and cell phone alert to go to both the sender, receiver, and whoever is responsible for HIPAA compliance. Event reporting is the procedure for alerting external entities about what has happened, including law enforcement authorities and the patient who’s privacy has been violated. This is an especially important aspect of HIPAA as many security lapses go unreported to authorities, ensuring those responsible are never caught. Conclusions Biomedical computing is a varied field, with many challenges. One aspect of software engineering that has had great success in meeting these challenges is standards. Some standards, HIPAA for example, are required by law. Other standards simply assist in successful program operations, such as a data dictionary and IEEE 1073. HIPAA has had a significant impact, causing both increased privacy and security for patients and an immediate need for covered entities to evaluate their current systems. References [1]Bryan Bergeron, M.D., Bioinformatics Computing, Prentice Hall, Upper Saddle River, 2003. [2] http://www.hhs.gov/ocr/hipaa/ [3] http://www.cms.hhs.gov/hipaa/hipaa2/enforcement/default.asp#penalties [4]http://www.fda.gov/cdrh/recalls/recall-020305.html [5]http://www.fda.gov/cdrh/ost/guidance/321.html [6] http://www.ieee1073.org/ [7]http://www.ncbi.nlm.nih.gov/Genbank/ [8]http://www.cgl.ucsf.edu/Research/collaboratory/project.html