wiki:GROUSE

Greater Plains Collaborative Reusable Observable Unified Study Environment (GROUSE)

GROUSE is our project to obtain health insurance claims from the Center for Medicare and Medicaid Services CMS through the Research Data Assistance Center ResDAC at the University of Minnesota.

Using GROUSE for Analysis

in progress; see #623 and these slides:

access, regulatory issues...

tools ...

CMS Data

i2b2 Data

Eventually, we plan to provide one integrated i2b2 with both the CMS data and the site data.

Meanwhile...

#323
GROUSE query by days supply, dx_source, and other PCORNet modifiers
#595
collect several i2b2 datamarts for GROUSE
#618
collect WISC i2b2 repository copy for GROUSE
#644
collect remaining i2b2 datamarts for GROUSE

PCORNet CDM Data

Eventually, we plan to provide one integrated CDM with both the CMS data and the site data.

Meanwhile...

#619
GROUSE: integrate CDM data from GPC sites
#647
Send copy of CDM to KUMC for Cancer RCR / GROUSE integration testing

Data Integration and Record Linkage

from executive summary

Agreements

We will be organizing our technical milestones on the roadmap (e.g. milestone:grouse-research-1) but also have preliminary thoughts under CompleteData.

Photo by Bob Gress, Birds in Focus.

Development: Data Staging, ETL

HackathonFour

Design Sketches, Usage Scenarios, Customers

In a 9 Aug meeting, we (at KUMC) identified 7 usage scenarios. We did some more detailed planning on the first few in a 29 Sep meeting.

Note CancerClaimsPilot

other customers:

  • IU ALS and DVT
  • Anne B. at UMN

1 CMS Files, de-identified

needs:

#623
training for GROUSE SQL queries
#646
SAS GUI for GROUSE Analysts

customers:

  • Mary S. from UIOWA on
    • CancerRCR Aim 3
    • pilot project(s) which ones, exactly?
  • Dr. Peggy Pessig is studying adverse drug events at MCRF.

The de-identified files (tables) have been prepared, using grouse/cms_deid.

Geocoding Integration

scheduled for Jan 2018 in an Oct 4, 2017 KUMC planning meeting

Note consensus in ​​ticket:508 on using obfuscated geographic location codes.

pilot 26, Project 3:

... Merge census-tract-level sociodemographic information derived from the American Community Survey and the 2010 Census Summary File to study cohort. Hypotheses: Differences in chemotherapy delays or discontinuation by race, ethnicity, or other sociodemographic characteristics will not be explained by differences in hospitalizations or other evidence of complications. Most patients will receive chemotherapy and subsequent treatments for complications from a single institution.

2 ETL CMS Files to CDM

CancerRCR#Aim3 is a customer; Jan 1 sync point: test SAS code.

source code: cms_i2p module and surrounding code.

Alternative option:

3 ETL CMS Files to i2b2

source code: grouse/etl_i2b2

KUMC:ticket:4481

4 KUMC CDM + CMS CDM

KUH pop health

  • ticket:526 finder file
    • and subsequent tickets to get the crosswalk file(s), since the scope of #526 has been narrowed:
    • #581: task: Crosswalk files for the remaining 4 GPC sites back from GDIT (new)
    • #564: task: Crosswalk files from 6 sites back from GDIT (closed: fixed)
  • ticket:??? create offset file: hash, pat_num, site_days_offset; establish master_days_offset

5 KUMC i2b2 + CMS i2b2

KUH pop health breast cancer: tumor registry (RW, SS)

6 big CDM (CMS + many sites)

  • #619 integrate CDM data from GPC sites

CancerRCR#Aim3 relies on this.

Replace 12 GPC popmednet nodes with big CDM?

7 big i2b2 (CMS + many sites)

  • #595 collect i2b2 datamarts
  • #597: spec for crosswalk to accompany i2b2 datamarts for GROUSE

obesity Davis @ KU Larry at WISC. Joan Neuner MCW

breast cancer: tumor registry (EC and co)

8 Provider Availability by County (potential supplemental data integration)

Check out federal datasets that measure the amount of health care providers available at the county level. Also see the materials and videos on RESDAC from Beth Virnig for ideas.

Last modified 7 weeks ago Last modified on Jan 10, 2018 8:13:08 PM

Attachments (8)