Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Redesign mapping file structure #3

Open
mbaudis opened this issue Nov 5, 2018 · 0 comments
Open

Redesign mapping file structure #3

mbaudis opened this issue Nov 5, 2018 · 0 comments
Assignees

Comments

@mbaudis
Copy link
Collaborator

mbaudis commented Nov 5, 2018

The current mapping file structure is based on the assumption of a primary use of icdom::icdot codes as the unique source keys. As we know, some of the mapping targets are more specific than any existing icdom::icdot combination.

I suggest to make this format more general, with the indication of what primary key is being used.
This would then also allow to have a way of making arbitrary compositions of primary keys, and even avoiding the specification of "primary" and "equivalents". However, this would require the addition of an information attribute to samples which indicates the original code attribution (this could default to icdom::icdot if not specified).

Current example:
equivalents:
  - label: Ductal Breast Carcinoma
    id: ncit:C4017
  - id: seer:26000
examples:
  - label: invasive breast adenocarcinoma
input:
  - label: Infiltrating duct carcinoma, NOS
    id: icdom:85003
  - label: Breast, NOS
    id: icdot:C50.9
New version:
pattern: "icdom::icdot"
examples:
  - label: "invasive breast adenocarcinoma"
close_matches:
  - label: Infiltrating duct carcinoma, NOS
    id: icdom:85003
  - label: Breast, NOS
    id: icdot:C50.9
  - label: Ductal Breast Carcinoma
    id: ncit:C4017
  - id: seer:26000

This now would be the specification of a sample with a primary assignment by NCIt, and the derived ICD and SEER codes:

pattern: "ncit"
examples:
  - label: "breast carcinoma [from DCIS, basal-like, triple negative]"
close_matches:
  - label: Infiltrating duct carcinoma, NOS
    id: icdom:85003
  - label: Breast, NOS
    id: icdot:C50.9
  - label: Triple-Negative Breast Carcinoma
    id: ncit:C71732
  - id: seer:26000
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants