|
Data Request Form
Request Form Tips
Data Use Agreement (PDF)
|
|
A Primer of Helpful Hints on the Types of Questions That Users Should Ask When Requesting Data
Accessing large data sets can be expensive and time consuming! By structuring your data request, with assistance from your health data organization, you will avoid potential problems and maximize the usefulness of each data retrieval.
I. How many years of data will be included?
A. By Calendar Year (CY): January 1 through December 31
B. By Fiscal Year (FY)
Municipal: July 1 through June 30
Non-municipal: October 1 through September 30
C. Otherwise, requestor should define study period, if it is different
Hint: A request for two calendar years requires data pulls from three fiscal year data bases
Hint: For multi-year studies; Verify changes in hospital universe, coding methods used for each data variable requested, and other additions/deletions which may have occurred between data years
II. Which hospitals will be included in this request?
A. Are data available for each hospital you are requesting?
(e.g. Chronic, Long Term care, Rehabilitative and other non-acute hospitals may not be required to report data to any public data sources.)
B. Has each hospital reported data for the study period requested?
(i.e. Are there missing hospital data for particular months or quarters)
C. Have there been any changes in services &/or operations for each requested hospital during the study period? (An addition of a new Intensive Care Unit (ICU) or hospital merger, for example)
D. Will hospitals be identified by name will they be aggregated into comparison groups? (e.g. Teaching status, number of beds, or location)
E. Do all hospitals use the same coding methodology to report the types of illnesses and treatments seen in their facility? (i.e. ICD-9 codes are the standard for diagnoses and inpatient procedures)
III. Are there specific socio-demographic characteristics needed for this request?
A. Sex
B. Age/Birth Date
In years
In weeks, if patient is less than 1 year old
Hint: The coding of many hospital procedures may not be based on age or gender considerations important to the study (for example, breast biopsies are also done on males; Tonsil and Adenoid procedures for older adults may be handled differently than in children)
Hint: Birth date may be considered confidential and therefore unavailable for specific research, Calculated age or month/year (without day) may be available as substitute information.
C. Zip Code
Zip code ranges may be aggregated into town, county, and other specified geographic areas.
Hint: Many requestors may omit zip codes for a particular study area and may exclude residents by unintentionally including invalid zip codes and/or excluding new valid zip codes
D. Race
Coding at each hospital is usually based on individual response, appearance, or other subjective criteria and may be considered "soft data"
Hint Race categories may need to be aggregated into larger groups (e.g. black, white, other)
E. Expected Type of Payment Data (Insurance Carrier)
The Expected payment source recorded on the face sheet of the medical record may not have been the actual source of payment
Because of the method in which these data were collected there may be no way to determine if the charge for the hospital stay and/or medical services provided were covered by the principal expected source of payment; the actual payment amount may vary
F. Expected Source of Payment Data (Health Plan)
Inpatient data reporting guidelines for Massachusetts acute care hospitals, amended by the state in January 1994, now require that in addition to reporting the expected type of payment data, hospitals must also report the "expected source of payment" data
Expected source of payment data uniquely identify 159 different health plans (e.g. Aetna, HMO Blue, Tufts Health Plan, Kaiser, Medicare Plus, etc.). Each expected source of payment code has also been assigned to a corresponding expected type of payment code, which identify each of the insurance carriers, including Blue Cross and Blue Cross Managed Care, Commercial Insurance, Commercial Insurance Managed Care, HMO plus ten additional managed and non-managed care payor type categories
Hint: For each "expected source of payment" code reported, consider analyzing whether the corresponding data by reported "expected type of payment" code matches the same payor type assignments defined by the state, to verify the accuracy and reliability of the various payor data reported
Hint: Consider aggregating carriers into broader categories (e.g. aggregating all Blue Cross non-managed carriers and all Blue Cross managed care carriers into Blue Cross)
IV. Other Clinical Data Set Elements
A. Length of Hospital Stay (L.O.S.)
Based on the sum of gross patient days during one hospitalization. However, some hospitals do include "Leave of Absence Days" as part of its length of stay calculations
a. Leave of Absence Days are the number of days of a patient's absence with physician approval during a hospital stay without formal discharge and readmission to the facility
Hint: Since not all hospitals code Leave of Absence Days, calculate length of stay using Net Days, as gross Days minus Leave of Absence Days
Hint: Researchers should decide whether they want to exclude records with lengths of stay of 0 days or lengths of stay greater than 999 days, &/or assign these cases to a length of stay based on predetermined criteria
Hint: For length of stay "outliers," i.e. patients with extreme length of stay, consider using "trimmed data;" Patients with lengths of stay longer than other patients with similar clinical characteristics and treatments may be excluded from average length of stay calculations because the inclusion of these cases will tend to inappropriately inflate these averages. These "outlier cases" can be the result of such factors as coding errors, patients awaiting nursing home placement, physician practices or an unusual set of circumstances. "Trimming" is a technique which reduces the variability in the data (Consult with technical staff for "trimming" methodologies)
B. Discharge Disposition Status
Patient status at the time of discharge from the hospital (i.e. patient sent home, patient sent to another hospital, long term or rehabilitation facility for additional (post-acute) services, patient expired during hospitalization)
C. Type of Admission: Indicates the priority status of the admission
D. Hospital Inpatient Admission Source: The source referring or transferring the patient to inpatient status in the hospital (For Emergency, Elective or Other Type of Admission)
Caveat: Definitions and assignment of these variables may be unique to each facility reporting information
Caveat: Information regarding these variables may be unavailable at the time the patient is discharged and therefore may be understated
Hint: Broad category definitions and data validation using other data sources should be considered as these data variables may be considered to be "soft data" by some hospitals
E. Admission Date/Discharge Date
These data may be considered confidential and therefore unavailable for specific data research
Day of the week, and month/year may be available as substitute information
F. Diagnoses
Principal Diagnosis Code: The ICD-9-CM (5 digit) diagnosis code corresponding to the condition established to be chiefly responsible for the admission of the patient for hospital care.
Associated Diagnosis Codes (up to 14): Conditions that co-exist with the principal diagnosis at the time of admission, or develop subsequently, which affect the treatment received or the length of the patient's hospital stay
Hint: Be sure to define your diagnostic criteria fully; For example, newborns may be classified using ICD-9 diagnosis codes V37-V37.2 & V39-V39.2, however, other newborn diagnoses include weight (coded in grams) and/or by using a fifth digit, in addition to the four digit diagnosis code. Newborns can also be defined using DRG
G. Procedures
Principal Procedure Code: The ICD-9-CM (4 digit) procedure code corresponding to the procedure most related to the principal diagnosis and performed for definitive treatment of the principal diagnosis rather than for diagnostic or exploratory purposes, or was necessary to treat a complication of the principal diagnosis
Associated Procedure Code (up to 14): Corresponds to the additional procedures performed during the length of the patient's hospital stay.
H. Diagnosis Related Groups (DRGs)
A software program established as the basis for the Medicare hospital reimbursement system which aggregates all diagnoses and procedures coded during a patient's hospital stay. (HCFA usually revises the DRG categories annually, for each data year there are at least 495 inpatient DRGs.)
The DRGs are a patient classification scheme which provides a means of relating the type of patients a hospital treats to the costs incurred by the hospital
Hint: Since HCFA could change DRGs annually, multi-year studies should include a matrix of DRG revisions, including both old and new DRGs
Hint: Consider other, more meaningful, aggregations based on your study criteria. (e.g. Aggregating by hospital department; Dividing Cardiology into Cardiac Medicine and Cardiac Surgery)
V. Charges
A. Total Charges: Thew full undiscounted charges summarized by prescribed revenue center codes. The total charges should not include charges for telephone services, television or private duty nurses. Any charges for a leave of absence period are to be included in the routine accommodation charges for the appropriate services (medical/surgical, psychiatry) from which the patient took the leave of absence. Any routine admission charges or daily charges under which expenses are allocated to the routine, special care & ancillary revenue centers are included in the total charges
Caveat: Charges are not costs nor payments
Hint: Calculate median as well as average charges when using charge data
Hint: Researchers should consider whether they want to exclude records with charges of $0 dollars, or records with missing or erroneous financial data (e.g. Per Diem charges less than $100) &/or assign those cases to a charge, based on predetermined criteria
Hint: Consider using criteria to "Trim" &/or exclude cases with extremely high charges; Patients high charges (outlier cases) may be the result of such factors as coding errors, physician practices or an unusual set of circumstances. Trimming charges is a technique based on predetermined criteria which may reduce the variability when using these data. (Consult with technical staff for "trimming" methodologies)
VI. Unique Physician Number (UPN)
A. Attending Physician Number
The unique number assigned to the physician primarily responsible for the care of the patient from the beginning of the hospital episode. The patient's personal physician who arranged for the patient's admission and directed the patient's care is normally designated the attending physician.
If the patient does not have a person physician, the hospital staff physician to whom the patient is assigned and who is responsible for the patient's care is the attending physician.
B. Operating Physician Number
The unique number assigned to the physician responsible for performing the principal procedure on the patient.
Caveat: UPN data are based on the actual license number of each attending &/or operating physician reported in the data base. However, UPN data are encrypted by the state prior to public release, so that the identity of an individual patient may not be directly or indirectly identified.
Caveat: Since UPN data are encrypted, these data cannot be used to determine the name of the actual attending &/or operating physician.
For more information on MHDC Data, please contact us
via e-mail or at (781) 419-7800. We welcome your further questions & look forward to your participation in our work and our events!
|
|