Download Desing of the Clinical Data Portion of a CANDA to Facilitate Review by CDER

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Design of the Clinical Data Portion of a CANDA to Facilitate Review by CDER
Kay Obenshain, SAS Institute, Inc., Cary, NC
- De
- Reports
e
IIi Protocol66~6
-=ReportS
fli Protocol 66·11
- T..,..". Report.
fa Protocol 66·18
- Graphics Reports
la Protocol 66-31
- Predeftled Expressions
H&_01
Expressions
. fa Protocol 66-34
I
~Study Groupings
Figure 1. Main Library window of SASIPH-Clinical® Software
support. Personnel from the PTC and other groups
at SAS Institute graciously participated in the design
of the current CANDA.
This paper presents the design of the clinical data
portion of a CANDA that was submitted to the
Division of Metabolic and Endocrine Drug Products
at the FDA's Center for Drug Evaluation and
Research (CDER) in 1995. SAS/PH·Clinical®
software Version 2.0 (PHC) was customized to
provide an interface to data presented in the
integrated summary of efficacy and safety (ISElISS)
sections of the New Drug Application (NDA). The
focus of this paper is on the technical aspects of
designing the PHC portion of the CANDA, referred
to here as the current CANDA.
PHC is a graphical user interface, so what is there to
design? PHC accepts multiple types of data
structures to accommodate the needs of a variety of
users. In order to allow as comprehensive a range
of data structures as possible, PHC requires data to
be "defined" to the product. PHC provides tools to
define data and to build objects for use in data
exploration. Thus, PHC can be customized
specifically for individual databases and users. The
current CANDA was designed by customizing PHC
for the ISElISS database.
INTRODUCTION
SAS/PH-Cliliical (PHC) provides tools for interactive
review of clinical trials data on a desktop computer.
PHC can be used as an in-house drug development
tool as well as for CANDA submissions.
Readers familiar with PHC know that there are many
ways to use the product. This paper presents the
use of PHC as an interface to one sponsor's data for
a single CANDA submission. It mayor may not be
relevant to another sponsor or different data from
the same sponsor.
PHC is a product of the PharmaTechnology Center
at SAS Institute (PTC). When a customer licenses a
PTC product, it is the beginning of a partnership
between the PTC and the customer. Working with
existing groups at SAS Institute, PTC offers support
services such as consulting, training, and user
Rather than present confidential data, examples in
this paper show fictitious data, protocol and variable
265
nan:'es, e~c. The purpose of this paper is to present
basl~ design elements, as clearly and concisely as
possible. Details that are important but do not playa
key role in the current CANDA design are simplified
or, in the case of functions and dictionaries, not
addressed.
description of data included in PH-Studies appears
in the "Data Files" section of this paper.
Advantages of this particular arrangement of data
include more rapid data access and increased
flexibility in data structure.
OBJECnVE
Customize SAS/PH-Clinical to allow a user to:
•
view all data presented in ISElISS tables and
listings, in chronological order
•
select any patient and follow his data through
several studies in a single view
•
subset ISElISS data to arrive at initial data of
interest
•
reproduce descriptive statistics presented in
ISElISS tables
Expressions
An expression is SAS code, represented by an
object that can be "opened" to subset data. The SAS
code is transparent to the user. As an example, the
Expression "Intent to Treat Population" consists of
the code: "ITTPOP 1". The user only sees the
name of the expression, "Intent to Treat Population."
When "Intent to Treat Population" is opened, PHC
displays data only from patients in the intent-to-treat
population.
=
Expressions were created and then saved in folders
(described below) according to the expression
category. Categories of Expressions include
Protocols, Study Groupings, Population Indicators,
Treatment Groupings, Discontinuation (Reason for),
and Adverse Event.
METHODS·
Designing the current CANDA involved:
•
building objects for data exploration
•
providing metadata (data about the clinical data)
to PHC
METHODS: BUILDING OBJECTS
Objects called studies, expressions, variable groups,
and folders were built by invoking PHC facilities to
"edit" each object Detailed descriptions of these
objects appear below.
The predefined expressions consist of SAS
operators and operands that identify individual
patients. A user can open a predefined expression
to subset ISEIISS data and arrive at particular
patients of interest.
Objects called PH-Templates are supplied as part of
PHC. A PH-Template is a mechanism for a
non-programmer to create a generic report by
running SAS® code. No PH-Template was altered in
any way for the current CANDA, and all
PH-Templates supplied with PHC were included in
the CANDA submission.
An expression may be applied "By Patient" or
"By RoW". Each predefined expression in the
current CANDA should be applied "By Patient". An
example of an expression to apply "By Row" would
consist of SAS code to identify assessments
collected multiple times per patient.
Variable Groups
A variable group was created and associated with
each PH-Study. The variable group for the study
Patient Data has three main branches: Patient
Background & Status, Efficacy, and Safety. Each
main branch has at least one additional level of
sub-branches. The Patient Background & Status
branch contains the sub-branches Population,
Treatment/Exposure, Disposition, Demographics,
Medical Conditions, and Concomitant Medications.
The Safety branch contains sub-branches Adverse
Events, Laboratory Results, and Vital Signs.
PH-Studies
A "study" in PHC is a set of data files, which mayor
may not correspond to a single clinical
study/protocol/trial. In this paper, a PHC "study" is
referred to as a PH-Study to distinguish it from a
clinical study .
In the current CANDA. each PH-Study contains data
from all clinical studies summarized in ISElISS.
Data were arranged in four PH-Studies: Patient
Data, Blood Chemistry Data, Hematology Data, and
Urinalysis Data. Only one PH-Study should be
opened at a time in the current CANDA
Folders
Folders were created in the Main Library window for
PH-Studies, expressions, and variable groups.
Sub-folders were created within folders, e.g., the
folder containing predefined expressions contained
one folder for each category of predefined
expressions (Figure 1).
Patient Data contains all ISElISS data, with a caveat
on lab data. Only main laboratory test results, in
standard units are in Patient Data. PH-Studies
Blood Chemistry Data, Hematology Data, and
Urinalysis Data contain complete laboratory data,
with results in "raw" and standard units. A detailed
266
METHODS: PROVIDING METADATA
Keys are employed to join data from different files.
An integrated database containing data presented in
As a simple example, labs and vital signs were
the ISEIISS consisted of SAS data sets. The
database was "defined" to PHC by providing data
about the data, or metadata, which determines how
PHC displays, joins, and partitions data. Definition
of the ISElISS database took place in the Study
Definition environment of PHC. Entities called data
files, keys, and variables were defined'to PHC, and
attributes were aSSigned to each entity.
collected on specific visit days, and the variables
PATIENT and VISIT exist in lab and vital sign data
sets. The keys PATIENT and VISIT can link data
from these two data sets so that labs and vital signs
can be merged and displayed side by side in a data
table by PATIENT and VISIT as if they were from the
same data file ..
Keys may also determine the level at which data
vary within a data file. As an example, in Patient
Data, in a laboratory data set, the variable DTEST
(name of lab test) was defined as a key because lab
data vary on the DTEST level within VISIT. When a
user requests a lab data table, lab test results are
automatically listed by PATIENT, VISIT, and DTEST.
Data Files
Data files defined to PHC are the SAS data sets in
the ISElISS database. The process of data
definition does not alter the data sets, and a user
cannot alter data files outside the Study Definition
environment. Each data file was assigned a function
and one or more keys.
In Patient Data, keys to joih files are PATIENT and
VISIT; and keys to determine level vary with each
file. In the lab studies, the key to join files is
PATIENT; and keys that determine level for lab files
are DTEST and VISIT. .
All data sets in the integrated database were defined
to the PH-Study Patient Data, with a caveat on lab
data described below. Some of the data sets
defined to Patient Data are also defined to the three
separate lab PH-Studies (Blood Chemistry,
Hematology, and Urinalysis). Data sets defined to
all four PH-Studies are those with patient
background & status information.
Additional functionality of keys, beyond PHC
Certain keys in the current CANDA have an
additional function that is not required by PHC, but is
an important element of the design. The keys
PATIENT and VISIT serve as protocol identifiers, as
described below.
Rather than have all lab data in one huge SAS data
set, it was decided to slice the data set both
hOrizontally and vertically before defining lab data to
PHC. Decreasing the size of data files horizontally
and/or vertically is a way to maximize PHC
performance.
In addition to its role as a PH-Study key, PATIENT
provides protocol information about each patient.
A patient could participate in a maximum of three
protocols, one of each type. The three protocol types
are short-term controlled, long-term controlled, and
long-term open label. Patients could enroll in one
(and only one) protocol of each type. A different
patient identifier was assigned in each protocol.
Thus, patients could have up to three different
patient identifiers during their progreSSion through
protocols.
Organization of data within PH-StUdies
Patient Data contains a hematology file, a blood
chemistry file, and a urinalysis file. Blood Chemistry
contains a blood chemistry file, Hematology contains
a hematology file, and Urinalysis contains a
urinalysis file. Lab files in Patient Data are a subset
of variables in the corresponding file in lab
PH-StUdies.
The PATIENT key consists of three parts:
• single letter code for first protocol entered
•
patient # in first protocol
•
single letter code for extension protocol(s)
A decode table was defined to decipher first and
extension protocol numbers. The decode table is
simply a hardcopy table, separate from PHC.
Lab data sets defined to Patient Data have the same
number of observations as those in the individual lab
PH-Studies, but Patient Data contains only the main
laboratory test results, in standard units. The lab
PH-Studies contain all lab data that is in Patient
Data, but also contain additional flag variables and
"raw" lab data.
In addition to its role as a PHC key, VISIT provides
protocol information on the VISIT level. A decode
table defines the type of study associated with
specific ranges of VISIT values. As an example,
VISIT values in the 100's pertain to short-term
controlled studies. The decode table also includes
values for special visits such as Screening and
Keys
Keys define structure in PHC
Keys are variables in the SAS data set that define
the structure of a data file in a PH-Study. Keys were
selected in order to structure the data so that it
makes sense clinically.
267
As an example, the vital signs data set, VS, contains
the "By Visit" identifier variables SC_ VS, LC_ VS,
and LO_VS. Vital sign data for patient A21 C from
visit 101 were included in analyses for protocol
66-01 only, while VS data from visit 301 were
included in analyses for protocols 66-01, 66-11, and
66-31 .
Discontinuation. The values of VISIT are assigned
so that they increase as a patient progresses
through protocols.
Variables
Attributes
. Variables were assigned attributes that determine if
they are available to a user, how they are displayed,
and how they can be used. All variables in the
ISElISS were "committed" to PHC and are available
for a user.
PATIENT
VISIT
A21C
A21C
A21C
101
201
301
_Protocol
SC VS
66-01
66-01
66-01
identifier variables_
LC VS
LO VS
66-11
66-11
66-31
The attribute "format" determines how a variable is
displayed, and "analysis type" determines how
variables are used. As an example of the latter,
ITTPOP is a numeric variable that flags patients in
the intent-ta-treat population. ITTPOP was assigned
"nominal" so that when a user
Analysis Type
requests summary statistics for ITTPOP, the only
statistic returned is "n". It would not make sense to
display mean or median ITTPOP values. ITTPOP
cannot be used for descriptive or inferential
statistics.
The "By Visit" protocol identifier variables are used
as Grouping Variables in PHC Summary Statistics,
which is analogous to "by vars" in PROC MEANS.
They are necessary to recreate cells in ISElISS
tables.
Additional functionality for variables
Certain variables in the current CANDA have an
additional function that is not required by PHC, but is
an important element of the design. Special
identifier variables were created before the data sets
were defined to PHC. These variables identify
patient number and protocol number for each type of
protocol a patient entered, and identify protocol
number(s) associated with individual observations.
All data were made available for viewing by
committing all data sets and variables. Data with
multiple records per patient are listed in ascending
order by the VISIT key, which was defired to ascend
across protocols in chronological order.
=
RESULTS
Functional requirements were achieved as follows.
Requirement: view all data presented in ISElISS
tables and listings, in chronological order
Requirement: select any patient and follow his data
through several studies in a single viewing.
Pooling data from multiple protocols into Cine
PH-Study allows the user to view all the data for one
patient in the PHC Patient Profile facility.
A naming convention was adopted to identify the
three types of protocols: SC, LC, and LO represent
short-term controlled, long-term controlled, and longterm open-label studies, respectively.
Requirement: subset ISElISS data to arrive at initial
data of interest
Variables STUDY_SC, STUDY_LC, and STUDY_LO
identify the protocol a patient participated in, where
SC, LC, and LO represent the three types of
protocols. Similar variables PATID_SC, PATID_LC
and PATID_LO identify the patient number in each
type of protocol. These protocol and patient
identifier variables are located in a data set with one
record per patient, and can be used to create
expressions to apply "By Patient". Thus, they are
referred to as "By Patient" variables.
Predefined expressions can be 'opened' in PHC to
subset data. Creation of "By Patient" identifier
variables facilitated building expressions.
Requirement: reproduce descriptive statistics
presented in ISElISS tables
Cells in ISElISS tables can be reproduced by
opening expressions to arrive at the patient
population of interest, and then requesting PHC
Summary Statistics with multiple Grouping
Variables. Creation of "By Visit" identifier variables
to use as Grouping Variables facilitated use of PHC
Summary Statistics.
Protocol identifier variables were defined on a
visit-by-visit basis for each data set that contains
multiple records per patient. These variables are
referred to as "By Visit" identifier variables, to
distinguish them from the "By Patient" identifier
variables described in the preceding paragraph.
These variables identify which of the three types of
protocols data were analyzed in.
CONCLUSION
The CANDA was installed at the FDA and a training
session was held for reviewers. A guidebook with
268
been 'uneventful' in that there have been no
complaints from the FDA as this paper goes to
press.
information specific to the current CANDA design
was written. Copies of the CANPA guidebook and
PHC documentation provided by SAS Institute were
presented to reviewers.
SAS and SAS/PH-Clinical are registered trademarks of
SAS Institute Inc. in the USA and other countries. ® indicates
USA reg istration.
Any reader who has spent time perusing medical
records knows that an 'uneventful' record is the kind
he would want to have. The current CANDA has
Other brand and product names are registered trademarks or
trademarks of their respective companies.
269