@Chris_Knoll, your points are well taken, and I should amend my comments to say that I that “Import Concept Set” (whether as JSON or as a concept id list) should be retained in CIRCE. But I’d like to peel back the onion a bit further so we can address some underlying questions in the coming weeks.
Are we positioning ATLAS to be a collection of applications? Or rather is it destined to be the primary OHDSI Platform application?
My bias is towards the latter, that ATLAS should be essentially an IDE for CDM-based analytics. This gets to my comment the other day regarding a “Wizard” type navigation for novice users. What matters are the functions, not their ancestral applications. Right now, CIRCE has been tasked with too much-- being a phenotype library, a phenotype designer, a concept set builder, and a cohort generator. It’s a remarkable package. But we now have the opportunity to look across all our functionality-- from all of our apps-- and think about what is the right flow to perform observational analyses. ATLAS’ evolution should reflect that flow and make it intuitive for the user.
Can we move from concept sets and cohort definitions to a true Knowledge Base?
Keeping track of assets for projects is tricky. Coders figured out a long time ago how to keep things clean by importing from libraries and tracking exactly what is being used in any given project (thanks npm install!). That is every bit as critical for OHDSI to achieve our transparency and reproducibility objectives. We’ve made great strides in transparency-- any cohort definition can be viewed so people can look through it and figure out what what defined as what. But we can do a lot better.
What is better is having a unified knowledge base of:
- Concept sets
- Clinical facts - e.g. Hypoglycemia = ([glucose in mg/dL concepts] < 70 OR [glucose in mmol/L concepts] < 3.9)
- Phenotypes - our current cohort definitions, along with metadata on performance in different environments, studies in which they are used, etc
Accepting the fact that Clinical Facts are not currently in our toolkit, I would still advocate that ATLAS should have a centralized Knowledge Base, and the first step in any “Project” is creating or selecting the assets you wish to use. ( Of course, you can always go back and edit your assets later in the project workflow.) CIRCE’s cohort definition editor is actually part of this Knowledge Base workflow, as is the Concept Set builder.
How to keep the Knowledge Base from becoming a huge mess?
I definitely catch your point about not wanting to formalize everything that goes into a cohort definition. You should be able to Import a list of concepts or paste in some JSON for a concept set. No problem. But ultimately, there is only one way for us to avoid creating a huge mess of Knowledge, and that is user context.
As we begin dealing with Auth this year, we’ve got to consider our plans for user context as well. At Regenstrief, we’ve got a massive NLP query library, but we allow users the option of sharing a definition or keeping it private. When something is good, you share it out and everyone has access. That keeps things clean and manageable while still letting people go to town on draft versions of stuff.
Sorry if I cruised beyond the core ATLAS/CIRCE scope, but I think that’s kind of the point. The old gods are dead. Let’s inventory their functions, erase their territorial boundaries, and make ATLAS the OHDSI platform to rule them all.