Download GDP - Association for Pathology Informatics

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
An Update on the Informatics Supported
Annotation and Integration of Datasets for
the Gynecologic Disease Program (GDP)
Waqas Amin MD1, Sambit K. Mohanty MD1&7, Anil V. Parwani MD PhD1&3, Sharon
B. Winters MS1, Nancy B. Whelan BS1, Althea M. Schneider BS1, John T. Milnes BS1,
Charma D. Chaussard BS4, Gail Harger MS2, Katherine Farrow BS2, Debra Bass MS2,
Tim D. Fennell MS1, Hai Hu PhD6, Thomas C. Krivak MD4, Rajiv Dhir MD3, Robert
P. Edwards MD4, Larry Maxwell MD5 Michael J. Becich MD PhD1&3,
1Department
of Biomedical Informatics and 2Department of Epidemiology, University of
Pittsburgh, PA. USA. 3Department of Pathology and 4Department of Obstetrics Gynecology,
Magee-Women Hospital, University of Pittsburgh Medical Center, PA USA. 5Walter Reed Army
Medical Center, Washington, DC. USA. 6Windber Research Institute Windber, PA, USA.
7Winthrop University Hospital, New York, USA
1
Introduction:
 The Gynecologic Disease Program (GDP), a.k.a Gynecologic Cancer
Center, is funded by the Department of Defense (DOD).
 Primary objective is to develop state-of-the-art capabilities in clinical
and basic science research aimed at improving screening, early
detection, prevention, and treatment of gynecologic disease.
 Collaborators: Walter Reed Army Medical Center, University of
Pittsburgh Cancer Institute, Windber Research Institute and
Georgetown Medical Center.
 At UPitt this program constitutes three cores:
 Luminex
 Proteomics
 Tissue Banking Informatics
2
Objectives:
 To create a well-characterized and standard-based biospecimen
repository for Ovarian and Endometrial malignancies.
 To facilitate the collection and transfer of well annotated
datasets to the central data warehouse (Windber Research
Institute).
 To create an internal storage system of transferred data using
the Clinical Trial Management Application (CTMA).
3
Common Data Elements
 Through the combined efforts of experts from various fields,
Common Data Elements (demographic, epidemiologic, clinical,
pathologic specimen and block annotation, genotype, follow up and
outcome) were developed for gynecological conditions.
 Common Data Elements (CDEs) allow consistency and
interoperability in biospecimen collection and better understanding
of research and experimental data.
 Major standards used to build CDEs:
 College of American Pathology (CAP) checklists
 Elements from coPath-synoptics
 North American Association of Cancer Registry (NAACR) core
elements
4
PATHOLOGY DATA
(coPATH and Synoptics – CAP Checklist)
OVARIAN TUMORS
EXTRAOVARIAN ABDOMINAL
TUMORS
ENDOMETRIAL TUMORS
OTHERS (HYPERPLASIAS)
•GDP # (for patient)
• Date of Surgery
• Surgical Procedure
• Macroscopic Attributes
- Anatomic Location
- Size
• Microscopic Attributes
- Histologic Type
- Grade (Silverberg grade
etc.)
- Perifocal reaction
- Nodal status
- Pathologic Stage
5
CANCER REGISTRY DATA ELEMENTS
(NAACCR STANDRARD)
OVARIAN TUMORS
EXTRAOVARIAN ABDOMINAL
TUMORS
ENDOMETRIAL TUMORS
•GDP # (for patient)
• Date of Surgery
• Surgical Procedure
• Clinical Staging (AJCC Staging)
• Therapy related variables
• Recurrence / Metastasis attributes
• Vital status
• Biochemical data
- Tumor markers
- Hormone receptors
OTHERS (HYPERPLASIAS)
6
Core Data Sources:
Questionnaire Data
Pathological Evaluation
(coPATH and synoptic)
Cancer Registry System
Clinical Trial Management Application
De-identified Data Export
Central Data Center (WRI)
7
Questionnaire Data:
 Pre-operative Case/Control Matching
 Postoperative Questionnaire
HOPE Ovarian
Endometrial
8
Pathology Data ( coPATH & Synoptics)
• Synoptic worksheet are implemented for use in data acquisition.
This provides a structured way of entering the diagnostic /
prognostic information for particular pathology specimen and
ultimately serves as medium for capturing and storing data for
translational research.
•
The data is stored as discrete data elements which appear as an
accession summary within the final pathology report.
• The synoptic data is manually or electronically imported into the
CTMA for linking pathological details on banked tissues.
9
UPMC Cancer Registry
 The UPMC Registry Information Services (RIS) is designed for the
collection, management and analysis of demographic, grading, staging,
treatment and progression data on cancer patients
 Primary sources for documentation are both the “paper” and “electronic”
medical records from which data is abstracted into the Cancer Registry
database by Certified Cancer Registrars.
 The entire UPMC RIS is built on the North American Association of Central
Cancer Registries (NAACCR) data standard architecture.
10
CANCER REGISTRY DATA ELEMENTS
(NAACCR STANDRARD)
Treatment
Follow Up
Recurrence
• Data imported from
Cancer Registry Tool.
• Data entry done at
source application (IMPAC)
by cancer registrars.
11
Technology:
 Clinical Trial Management Application is a web-based Java
application for managing various aspects of clinical trails, research
protocols and outcome initiatives.
 Provides an integrated set of components for managing
administrative and regulatory tracking (e.g., IRB-related issues).
 Facilitates study protocol schema, treatment calendar and financial
management issues for capturing study specific patient
information.
 Used for internal storage of data in GDP study.
12
13
Result:
 At UPMC (Magee Women’s Hospital) total number of consented
patients is 109. Endometrial cancer (45), ovarian cancer (13),
withdrew from the project (33), benign conditions (12), and the rest
are pending for surgical procedures.
 Patient privacy protection is of utmost importance and is enforced in
accordance with Health Insurance Portability and Accountability Act
(HIPAA) and only de-identified is transferred to WRI
 Well-structured study protocol is implemented for the acquisition of
high quality and well characterized annotated biospecimens and
transfer of data to central data warehouse (WRI).
14
Overall Workflow for GDP
15
Overall Workflow for GDP
Patient consented by Research Nurse Coordinator
(RNC) and enrolled in GDP study.
RNC assigns a GDP Number to the patient, or case, fills out the
Paper-based and Web-based (Clinical Trials Management
Application [CTMA]) Pre-operative Case/Control Matching
Questionnaire, and collects blood sample.
16
Overall Workflow for GDP
Patient undergoes operation and surgical specimen is accessioned for
pathological examination (generating coPATH Data). Biospecimen are
stored in Magee women Hospital Tissue bank tissue bank and data is
Stored in TBINV
BENIGN
MALIGNANT
Post-operative blood
sample collected.
CASE COMPLETE
FROM GDP STUDY
17
Workflow for GDP
Hope Survey
RNC completes Paper-based and Web-based (CTMA) Post-operative questionnaire (HOPE
for Ovarian / Endometrial post-operative questionnaire) and collects blood sample.
RNC MAKES A COPY OF ENTIRE QUESTIONNAIRE DATA & SENDS THE
ORIGINAL AND THE COPY TO THE FOLLOWING:
ORIGINAL
Data Manager, Department of
Epidemiology, University of Pittsburgh
COPY
Department of Biomedical Informatics
(DBMI) and Cancer Registry Team
Scansoft
Processing
18
Workflow for GDP
Data Transfer
Transfer Electronic version of Questionnaire Data (Excel layout)
to Windber Research Institute (WRI) and send one copy to DBMI.
Scansoft CSV
Output Files
19
Workflow for GDP
CTMA Annotation
DBMI post-doctoral fellows enter pathology data into CTMA (internal use) which is
pulled into Excel layout for WRI. Medical Registry data directly pulled for transfer to
WRI (Excel layout). Tissue bank inventory data is pulled out in Excel format and
transfer to WRI
20
Workflow for GDP
Excel File Generation
Transfer Pathology and
Registry Data to WRI
21
Conclusion:
 The Gynecologic Disease Program acts as a central repository
for clinically annotated gynecological tumor tissues for the
research community.
 This tissue banking initiative provides an infrastructure of joint
multi-institutional bioinformatics network that facilitates the
sharing of clinically annotated data and high quality
biospecimens to support important research activities.
 With information gained from these research activities, we will
continue to improve screening, early detection, prevention, and
treatment of gynecologic disease.
22
Acknowledgments
Collaborators:
–
–
–
–
Walter Reed Army Medical Center
University of Pittsburgh Cancer Institute
Windber Research Institute
Georgetown Medical Center
Leadership
Dr.
Dr.
Dr.
Dr.
Ronald Herberman
Robert Edwards
Rajiv Dhir
Anil Parwani
Dr. Michael Becich
Dr. Larry Maxwell
Dr. Michael Feldman
23
THANK YOU
24