What is a phenotype in the context of observational research?

apotvien · May 21, 2019, 3:24pm

Hello all,

In order to move forward in a way that is precise and in agreement, we’ve begun a glossary of terms relevant to the phenotype library. So far, we’ve established the following definitions. Do others agree or disagree with these wordings?

Phenotype - as it pertains to observational research, an observable set of characteristics in health data about an organism. A phenotype’s purpose is the desired intent to identify members in health data with the observed set of characteristics of interest. The observable set of characteristics can include conditions, procedures, exposures, devices, observations, etc.

Phenotype Algorithm = Cohort Definition - is a coded set of instructions for best approximating the desired intent of identifying members of a phenotype. Defines a set of members in health data who satisfy one or more criteria for a duration of time. Each phenotype could have one or more phenotype algorithms (e.g. T2DM broad, T2DM narrow). The instructions could be rule-based (heuristic) or computable (probabilistic). Heuristic based phenotype algorithms consist of rules and one or more concepts sets. Probabilistic phenotypes are implemented using a predictive model.

Cohort - instantiation of a phenotype algorithm

Gold Standard Phenotype - a “Gold Standard” phenotype is one that is designed, evaluated, and documented with best practices.

Also, are there others you would like to see added to the glossary (e.g. Concept Set)?

Mark_Danese · May 22, 2019, 2:06am

I will add a few comments to the above discussion because we have spent some time sorting through this. The horse may be out of the barn, so to speak, but I thought it would be good to document some other opinions on these ideas.

The use of the word “phenotype” is confusing. See this paper from JAMA where it is used differently and means an actual “clinical phenotype”. https://jamanetwork.com/journals/jama/fullarticle/2733996 As Christian mentioned, I also think about people using hospitalizations, devices, labs, and costs – “phenotype” doesn’t really extend to these ideas in any natural way.

There is already a good word used for this idea – “concept” (or “research concept” if you prefer). That is why the code list is called the “concept set”. So, if algorithm is the implementation, then it is the implementation of the concept, not the phenotype.

Separating out “concept set” from algorithm doesn’t make sense. The concept set is the fundamental building block of the algorithm. In fact, it is the only required element for an algorithm. The hierarchy is that a concept set is part of an algorithm, which is part of a cohort, which is part of a protocol. Each level potentially adds additional information that only needs to be specified at that level.

Finally, the definition of a cohort should not include “followed for a period of time”. Follow-up is part of the protocol/study. I can use a single cohort with different durations of follow-up for different purposes (time to hospitalization, death, treatment, etc.). This is not just my opinion – look up the definition of cohort in Rothman’s Modern Epidemiology. (I don’t have my copy handy, but it is essentially a group of people at a given time with no mention of follow up requirements).

Sorry to hijack the thread – I know Aaron is trying to move forward, but these words are the building blocks and it is probably worth discussion on the terminology. The hardest job is almost always naming something.

Patrick_Ryan · May 22, 2019, 2:43am

‘Persons hospitalized due to pneumonia’, ‘Persons with a cardiac pacemaker’, ‘Persons with elevated hemoglobin A1c values’, these are examples of ‘phenotypes’ that firmly fit within @hripcsa’s definition of ‘a specification of an observable, potentially change state of an organism’. Each of these phenotypes can be operationalized with a phenotype algorithm (e.g. #1 - looking for an inpatient visit and a condition occurrence of ‘pneumonia’ between the visit start date and visit end date; #2 - looking for device exposure of ‘pacemaker’; #3 - looking for measurement of ‘hemoglobin A1c’ with unit of % and value > 7.5). And, aligned with OHDSI’s definition of a cohort, each phenotype algorithm can result in a set of persons who satisfy the inclusion criteria for a duration of time.

Please let’s not conflate the word “concept”, which has a very clear and literal meaning throughout the OHDSI community and forms the basis of entire vocabulary structure. Concepts are not phenotypes, and a conceptset does not materialize into a list of persons. A cohort definition requires inclusion criteria and specification of cohort entry and exit; while inclusion criteria may use conceptsets as one dimension of the logic, it’s important to recognize that other domain-specific attributes and temporal logic also contribute to criteria, so they are not synonymous.

We cannot ignore the dimension of time in a cohort definition, because the phenotype is an ‘observable, potentially changing state’. The phenotype of ‘persons hospitalized for pneumonia’ starts when they are admitted and ends when they are discharged. The phenotype of ‘persons with pacemaker’ starts when the device is first implanted, and persists until the device is removed. The phenotype of ‘persons with elevated HbA1c’ requires some specification of when the elevations start and stop. But it is important to highlight, just because a cohort definition requires a cohort entry and cohort exit, that does NOT mean that you are in any way bound to these dates for any analysis. In fact, ALL current OHDSI standardized analytics tools for characterization, population-level effect estimation, and patient-level prediction specify a ‘time at risk’, which can be disjoint from the cohort start and cohort end. It is quite common that when a cohort represents an outcome, that we are most interested in only the cohort start (e.g. in a time-to-event analysis) and make no use of cohort end whatsoever. In GWAS studies, we only require the binary classification of a person belonging to a phenotype (without regard to either cohort start or cohort end), because we pre-suppose that the genetic association observed is temporal insofaras the genes preceded the trait. But in a self-controlled case series, it is important to know the durations for which a person is ‘exposed’ and ‘unexposed’ to whatever phenotype under study throughout an observation period, so the notion of cohort start and end can be quite helpful in that context. All of this is to state, we should not conflate analysis ‘follow-up’ time with ‘cohort’ time, but we should also not dismiss the reality that a cohort does have a temporal component (whether its used or not).

@Mark_Danese said he didn’t have it handy, but I happen to have my copy of Rothman’s Modern Epidemiology just 16 inches from my keyboard, so I thought it useful to clear the record of what’s actually defined there. Rothman does use different words than us (and other epidemiology textbooks), but is clear that he is simply providing a language convention that he is using throughout his textbook.

Page 38: “The term population as we use it here has an intrinsically temporal and potentially dynamic element: One can be a member at one time, not a member at a later time, a member again, and so on…The term cohort is sometimes used to describe any study population, but we reserve it for a more narrow concept, that of a group of persons for whom membership is defined in a permanent fashion, or a population in which membership is determined entirely by a single defining event and so becomes permanent…”.

So, in Rothman’s parlance, we have a broader notion of ‘cohort’ that is fully consistent with his definition of ‘population’, but in this book, he chooses to restrict ‘cohort’ to a limited subset of what is covered by our definition. Note here also though, cohort is defined in the context of a ‘cohort study’ (as its done in many of epidemiology textbooks with varying definitions). But in our context, we are not limiting the use of a cohort to a ‘cohort study’, and quite the contrary, we are using cohorts are fundamental building blocks to all our analyses, including characterization, estimation and prediction. This is exactly why we’ve needed to disambiguate the notion of ‘analysis follow-up time’ from ‘cohort membership time’, and why the OHDSI tools accommodate the range of options using both notions of time to meet the needs of the researchers.

krfeeney · May 22, 2019, 3:17am

I love Ken and all… and think this entire topic is easily the most controversy since the Kentucky Derby winner. In the interest of helping the fearless @apotvien come to a point of consensus, it may make sense to turn a synopsis of the competing points of view presented here into a Google Form (aka a digital ballot) that can be democratically voted on like other CDM convention proposals. If we’re not already using Git issues, I could see that lending itself well to track the component build out that’s being discussed.

I have heard rumors if you put a name field on these Google forms, Anna Karenina characters are known to troll OHDSI community proposals but I suspect a manual chart review could validate true versus nefarious votes.

Don’t mean to debate but as someone who desperately wants to consume this phenotype library, I selfishly would love to see us move towards action. We have so many network studies that could help test and validate these approaches.

hripcsa · May 22, 2019, 3:41am

A few comments on the naming, although not a solution.

First on phenotype. The original biological use was the visible manifestation of the genome, possibly modified by the environment. Manifestations brought on solely by the environment were not relevant to those researchers and were not called phenotypes. The term got generalized to more general clinical manifestations, with the previously mentioned JAMA paper serving as an example. There is no particular reason that the output of the classification algorithm had to pick genome-related differences. In that paper, it more or less means pathophysiologic (or physiologic) subdivisions of groups of patients (or healthy persons). Informatics generalized it further (and specialized it) to mean any definable property, usually derived from some kind of electronic data. The definition in my paper starts with the JAMA paper use and explains how it is used (and modified) in informatics.

In OHDSI, cohort is not a bad term, but we spend a lot of time saying “cohort,” and people say “what?” and we say “oh a phenotype” and they say “oh ok.” So we have leaned that way.

I think the big generalization in informatics is using phenotype for things that are not the person’s fault, like how long they were observed in the database. In effect, the informatics view treats the electronic recording as the primary object, and anything recorded there is fair game. So a given study may want only people with a phenotype of having been observed for at least a year.

I would stay away from reusing the word “concept” to refer these definitions. I do not think that epidemiologists, clinical researchers, biostatisticians, informaticians, etc. would understand that the word “concept” includes logic and criteria for a clinical state. It just doesn’t feel right to say I have two related concepts, one with diabetic patients aged 40 and over, and the other with such patients aged 41 and over. It feels like one concept with slightly different criteria. Informaticians use the word “concept” in the context of ontologies, so I would just keep it there. And “concept set” is simply a set of those concepts. Not a phenotype or cohort. And a cohort does not require a concept. E.g., the cohort of all patients in the database.

We won’t go to “population” as I think that will add to the confusion.

I think we are stuck with “phenotype,” explaining to some that it is a cohort, or “cohort” explaining to others that it is a phenotype. Probably not worth trying to come up with a formal distinction between those two.

Christian_Reich · May 22, 2019, 7:55am

Friends:

Since I was the one to open the debate about these definitions, let me propose a synthesis of all that was said to make @apotvien’s life easier. I think we have a good grasp of the elements we want to define, but we have still nomenclature problems with the term Cohort:

Phenotype: A pattern of characteristics in health data (criteria) in a set of people for a duration of time. These observables can be conditions, procedures, drug exposures, devices, observations, visits, cost information, etc.

I think that “pattern” is better than “set”, because it indicates a relationship between the observables or critera (insulin-dependent diabetic: Patient with the Condition diabetes mellitus and being treated with a drug containing insulin).

Phenotype Algorithm = Cohort Definition: A coded set of instructions for approximating a phenotype in a given dataset, which may or may not have complete and accurate evidence about each of the observables and their pattern. Each phenotype can have one or more phenotype algorithms (e.g. T2DM broad, T2DM narrow). The instructions could be heuristic (rule-based) or probabilistic. Heuristic algorithms consist of rules applied to concept sets. Probabilistic phenotypes are implemented using a probabilistic model.**

This is similar to @apotvien|s definition, except there is no more desire involved (desires could be a good thing, but not in the context of these abstract definitions), and that the algorithm doesn’t define members, but rules. And that heuristic rules are also computable, so I took that out. And that the model is probabilistic, not predictive.

Now we need to name the actual instantiated set of members identified through execution of the algorithm. We can (i) call that Cohort, or (ii) we can make Cohort a synonym for Phenotype and call this Cohort Instance. The former means Cohort is the ideal desired pattern of things (insulin-dependent diabetics), the latter denotes an actual set of people and the timelines a certain algorithm or definition has calculated in a database (cohort 123 in database XYZ).

(i) I actually like the idea to use the terms interchangeably. Reason is the avoidance of confusion. Folks who have a hard time calling a drug or device exposure a phenotype can call that a cohort and be happy. Folks who have a hard time calling an outcome a cohort, which is a lot of our traditional epidemiologist friends, can call that a phenotype and also be happy. If we want to be really nice we might even include Rothman’s Population as well. I don’t have a strong feeling about that.

(ii) This how we have used the word Cohort mostly, ATLAS calls it that way (even though the nomenclature in the ATLAS UI badly needs overhauling), and @apotvien et al. proposed it.

Anyway. Whatever we decide:

Cohort/Phenotype Instance (i) or Cohort (ii): An instantiation or execution of the instructions of a Phenotype Algorithm/Cohort Definition against a dataset, resulting in a set of patients and their timelines.

I agree with @Patrick_Ryan that Concept is not a term we want here. Concepts are semantic entities representing medical events or facts, and they are needed for those algorithms.

Now, we still have the precious metals. @japotvien has a Gold Standard Phenotype as “one that is designed, evaluated, and documented with best practices.” What is the “one” thing here? What does it apply to: A Phenotype, as @apotvien has it? Can’t be, because that is an intended ideal we need to approximate, which means, all of them are Gold. A Phenotype Algorithm? Can’t be, because the evaluation and documentation depends on an instantiation. A Cohort (Instance)? That would be the right thing, except it makes it totally not transferrable, and therefore practically useless.

Also, we want Gold. Do we also want to take on Silver? Something that is not fully validated against some truth (the “chart”), but only probabilistically? Bronze - something we pull out of a sleeve after chewing the pencil and scratching our foreheads for a while (which is what 99.9% of what all published phenotypes are today)?

Please help.

apotvien · May 22, 2019, 1:32pm

Thank you. All attempts to make my life easier are welcomed.

Christian_Reich:

Now, we still have the precious metals. @japotvien has a Gold Standard Phenotype as “one that is designed, evaluated, and documented with best practices.” What is the “one” thing here? What does it apply to: A Phenotype, as @apotvien has it? Can’t be, because that is an intended ideal we need to approximate, which means, all of them are Gold. A Phenotype Algorithm? Can’t be, because the evaluation and documentation depends on an instantiation. A Cohort (Instance)? That would be the right thing, except it makes it totally not transferrable, and therefore practically useless.

Also, we want Gold. Do we also want to take on Silver? Something that is not fully validated against some truth (the “chart”), but only probabilistically? Bronze - something we pull out of a sleeve after chewing the pencil and scratching our foreheads for a while (which is what 99.9% of what all published phenotypes are today)?

By “Gold” here, I mean phenotype algorithm, because that’s what every entry in the library will be.

Let me try to frame this with a cooking analogy.

Suppose we wish to make chicken noodle soup. When we imagine what that looks like, we’re all thinking about roughly the same thing. However, when it comes down to the details about how to make a chicken noodle soup, there are a plethora of recipes out there. Even if two people follow an identical recipe, they may get different results. Before moving on, the notion of chicken noodle soup is the phenotype, a particular chicken noodle soup recipe is a phenotype algorithm, and an actual pot of hot chicken noodle soup sitting on the kitchen table (mmm…) is the cohort (an instance of an applied recipe).

Now, there are two ways such an applied recipe can fail: 1) The recipe itself is inherently bad; maybe it leaves out the noodles and calls for the chicken to remain raw, and 2) The recipe was not followed by the cook; it calls for ingredients not in the cook’s pantry so the cook left those things out, and it turned out poorly.

Turning back to our library, I think this highlights the importance that the validation relies on both the authors and the validators alike. The author needs to be given the opporunity to lay out all of the pieces (ingredients) required to successfully implement their proposed phenotype algorithm. Likewise, a validator is obligated to report metrics only if they followed the author’s stated instructions and intended use.

If that contract is met, we should be able to automatically discern high quality phenotypes over time as they are validated, much like seeing a recipe with multiple 5-star reviews. The notion of “Gold Standard” refers to the idea that the phenotype algorithm went through an agreed upon process to be admitted into the library, but it doesn’t pass judgements about the performance characteristics. The notion of what’s acceptable will vary from case to case and person to person – It’s subjective, just like who we believe has the very best chicken noodle soup recipe.

tlasky · May 22, 2019, 1:42pm

This has been a rich and fascinating discussion. I recognize that this will be archived here, but am wondering if it could be synthesized and summarized, perhaps in the form of an article. While OHDSI will develop its own definition, the thinking can inform other groups who are undergoing similar processes, and can also serve as instructional material, helping to educate all of us on the many aspects of phenotypes (or whatever term is used).

rjking · May 22, 2019, 2:25pm

Hi all,

I wanted to share a nice paper on the topic to add to the discussion - https://rethinkingclinicaltrials.org/resources/ehr-phenotyping/. Aligns with your discussion. Not suggesting it over any of your current definitions where different - I’m a new fly on the wall - just sharing.

Best,
Ray (Epidemiologis/Informaticist at CDC)

Mary_Regina_Boland · May 22, 2019, 4:28pm

I think its important to note that:
EHR ‘Phenotypes’ are at best an approximation of the ‘True Phenotype of Individual’
‘Phenotyping algorithms’ are methods that allow for the approximation of an EHR phenotype

Therefore in terms of quality/accurateness:
‘Phenotype as defined by an EHR Phenotyping Algorithm’ < ‘EHR Phenotype’ < ‘True Phenotype’

The reason I make this distinction is because most EHR phenotyping algorithms have some level of accuracy - say 95%. This accuracy is typically determined by comparing the phenotypes generated from the algorithm with data also in the EHR (be it notes, structured data, etc.). Very rarely do we recruit patients and then determine their ‘true phenotype’ and compare against the ‘EHR phenotype’ and then the ‘phenotype from the EHR phenotyping algorithm’. This is an important point because often insurance status affects whether certain tests are performed (i.e., the EHR data that informs our algorithms) which thereby affect the EHR phenotype generated. If a patient has never been tested for a disease they will likely not have the EHR data for that disease - although in truth they may have the disease.
I discuss some of these issues in a 2013 JAMIA paper:
https://academic.oup.com/jamia/article/20/e2/e232/709983

At best all we can hope for is high quality ‘EHR Phenotype’ information - we cannot capture ‘True Phenotypes’. This addresses some of peoples concern over the ‘gold standard’ terminology. I would say that accurate ‘EHR phenotype’ information is a gold standard while the ‘true phenotype’ is a platinum standard - amazing if you can get it, but very hard to acquire and also very rare.
Therefore, algorithms that approximate EHR phenotypes are ‘silver standards’, which is consistent with how the term ‘silver standard’ is used in the field as well.

Mary_Regina_Boland · May 22, 2019, 4:41pm

To make my definitions a little more organized:
Platinum Standard: The True Phenotype of the person
Gold Standard: True EHR Phenotype (this should be non-institution dependent - therefore it should not be based on how your specific institution has coded diabetes, but that diabetes was coded in the EHR)
Silver Standard: Phenotype inferred from Phenotyping Algorithm Applied to EHR data (could vary by institution). The gold standard should be based on definitions that increase the accuracy of the silver standards across institutions

Some examples:
1.) Homeless person with diabetes, goes to hospital because injured in an accident. No one tests for diabetes - no EHR data on diabetes.
Platinum standard: diabetes (impossible to capture by algorithms b/c data on diabetes does not exist in EHR)
Gold standard: no diabetes
Silver standard: no diabetes

2.) Person with diabetes, goes to hospital and is coded with one diabetes code, they then lose insurance and stop being treated for diabetes at that institution
Platinum standard: diabetes (could be possible to capture if you lower the threshold to include 1 diabetes code presence in EHR, but this will also increase the false positive rate)
Gold standard: no diabetes (will only be listed as diabetes if the definition is expanded to include patients with 1 diabetes code)
Silver standard: diabetes (if you define at your particular institution to include all patients with any diabetes code) - this could have high accuracy at your institution, but unlikely to generalize across institutions. The generalizable phenotype definitions should be considered ‘gold’

ericaVoss · May 22, 2019, 6:27pm

We had an internal meeting where we discussed this with our broader department. We landed almost identical to what @Christian_Reich wrote above, so I’ll just merge the two.

Phenotype
A phenotype, as it pertains to observational research, is a pattern of observable characteristics in health data for a set of people for a duration of time. These characteristics can include conditions, procedures, drug exposures, devices, observations, visits, cost information, etc.

Phenotype Algorithm = Cohort Definition
A phenotype algorithm is a coded set of instructions with the desired intent of identifying members of a phenotype in health data. Each phenotype could have one or more phenotype algorithms (e.g. T2DM broad, T2DM narrow). The instructions could be heuristic (rule-based) or probabilistic. A heuristic based phenotype algorithm consists of rules and one or more concepts sets. A probabilistic phenotype algorithm is implemented using a probabilistic model.

Cohort or Phenotype Instance
A cohort instance or phenotype instance is a set of patients for a duration of time which result from the execution of phenotype algorithm instructions against health data.

Again I welcome people to challenge these and provide feedback.

Many thanks to @Frank for help with wordsmithing.

####################################################

@apotvien I think we should at least captured our definitions. Maybe on the WIKI for the Phenotype Library.

Mark_Danese · May 22, 2019, 6:46pm

Just trying to understand how to use this proposed language.

Let’s say I want to conduct a study of cardiovascular outcomes in new statin patients. I want to use first statin use to define the index date, I want to include hypertension as an inclusion criterion, I want to use congestive heart failure as an exclusion criterion, and I want a number of baseline exposures for my multivariate model (age, sex, hyperlipidemia diagnosis, family history of cardiovascular disease, LDL cholesterol, and number of hospitalizations in the last year). My outcome is death, myocardial infarction or stroke. (Not a complete study of course.)

In my world, each of those pieces (study variables) are operationalized with an algorithm. The cohort is the group of people who meet the index, inclusion, and exclusion criteria. The follow up time is time until end of observation, death, myocardial infarction, or stroke. I might have different follow up times for the cohort if I want to look at overall mortality only (i.e., ignoring the cardiovascular event outcomes).

Is the phenotype the intersection of all of the inclusion and exclusion criteria? Or are each of those criteria independent phenotypes? Or is this the cohort?

Same question with baseline variables and outcomes – are these separate phenotypes? Or is the phenotype the sum of all of the pieces of my study (and most people would have slightly different phenotypes)?

Chris_Knoll · May 22, 2019, 9:17pm

I’ll give it a shot to translate those pieces into OHDSI:

T (target cohort): new statin patients.
O (Outcome cohort): 1 or more cohorts identified by the phenotype algorithm that represents the cardiovascular outcome of interest.

include hypertension = A least 1 occurrence of diagnosis of {hypertension concept set} (note: you didn’t include any time window information here. Recent diagnosis, any prior diagnosis? within 1 year?)
exclude congestive heart failure: Exactly 0 occurrence of diagnosis of {CHF concept set} (note: no time window specified, when should these CHF events appear that would exclude? 10 years ago means they are excluded? only within a year?)

I’m not sure I follow the terminology of ‘exposures’ here, but it sounds like you are describing ‘features’ of the population. If you are saying that you require a certain age, sex, family history, etc for the person to qualify for the cohort, then it’s inclusion criteria. If it is a ‘feature’ of the population after you’ve identified your new statiin users, then it’s a not part of the cohort definition. It’s a covariate that you can include in your model (which are extracted via Feature Extraction or some other mechanism).

Those are three different cohorts, an Outcome (O) cohort per outcome of interest, which you will have a phenotype algorithm to identify each.

Cohort definitions are not concerned with the followup time that is specific to your study, or which outcomes of interest may end this followup time. When you define your new statin user cohort, the question is: how long are they present in the cohort? Are they new users just for that day? Do you let them qualify as a ‘new user’ for a fixed duration after their initial exposure? Do you consider the first stretch of continuous exposure to the statin the period of time that they are new? These are all decisions that go into your cohort definition to define when the person enters the cohort and when the person should leave. The simplest definition would say the person enters the cohort at the first statin exposure, and they are considered 'new users’for all time after the first exposure.

On the other hand, when you are defining your study, you establish what you what to use as your follow up time (or what we sometimes call Time At Risk). This time window can be based off of the person’s cohort_start, cohort_end, or something in-between. Depends on what you want to do. Did you want to study the risk of an outcome during the 6 months after a patient ends their exposure to statin? Start with a chort where the start-end represents exposure, and then your follow-up/time-at-risk is from the cohort end date to 183 days afer cohort end date.

I don’t think each of the elements of the phenotype are themselves a phenotype. A concept set is not a phenotype, and a single criterion may not be enough either (eg: cohort of men? What’s the start/end of that statement?)

The phenotype is an intersection of inclusion and exclusion rules, but more than that, it is also the specification of how long the people should be considered part of the phenotype. This is not follow-up. Follow up is study-specific that you decide when designing a study (are we looking at short term risks or long term risks). You could consider the cohort_start to cohort_end as the time at risk period, but that’s just another decision you make when designing your study.

I think you could use a phenotype to identify a baseline variable (ie: they were in a phenotype within 30d of baseline/index). But not all baseline variables would be a phenotype. The number of Inpatient visits in the past 6 months is a basline variable. It is not a phenotype.

-Chris

Mary_Regina_Boland · May 22, 2019, 11:31pm

the problem with Christian’s definition of phenotype

as “A pattern of characteristics in health data (criteria)” is that fundamentally you have inclusion and exclusion criteria. The algorithm includes or excludes from the phenotype. I think the issues others are having on this thread is that we all fundamentally know in our gut that inclusion and exclusion criteria does not make sense in the context of a phenotype in the sense of what is ‘diabetes’. Its possible that you will fail the inclusion criteria and yet still have the phenotype. If by ‘phenotype’ you mean a set of exclusion or inclusion criteria then that would be a cohort but not a phenotype. The words are not really interchangeable.
The definition of cohort (from wikipedia) is:
“In statistics, marketing and demography, a cohort is a group of subjects who share a defining characteristic.”

Therefore ‘death’ is a characteristic - a group of people who died are a cohort. Death is not really a phenotype in the truest sense of the word. Death within a certain timeframe is an important outcome or characteristic of interest. An outcome is often the result of a Phenotype + Exposure, but can also be the result of the disease or Phenotype itself. Its important not to confuse these terms.
It might simplify things if there was an outcome, disease and an exposure library where all of the cohorts were defined. It will be easier for people to understand what everything is. Otherwise, if you call everything a ‘phenotype’ it will be really confusing.

hripcsa · May 23, 2019, 2:01pm

I think there is just an ambiguity on the term phenotype and we can decide how to deal with it. Richesson, who is linked to above, highlights the ambiguity in first defining “phenotype” and then defining “EHR-based phenotype definitions, or simply phenotypes.” That JAMA paper is also a good example. There is no concept (no chicken soup) underlying a dimension reduction on observable variables to classify a disease into 4 subtypes with slightly different outcomes. There are an infinite number of ways to divide a disease such that patients have slightly different outcomes within groups. There may be a hope that there is an underlying biological truth to it, but not clear. It doesn’t seem that much better than an arbitrary classification based on EHR data.

For those classification phenotypes (where the word “phenotype” was approved by JAMA editors) failure of the of the inclusion criteria does in fact exclude you from the phenotype because there is no underlying truth to the classification to compare to. Either you fit the criteria or not.

I am good with the @ericaVoss definition but I do agree with others that while phenotypes change over time, there is no need to define a phenotype as having an implicit duration (“for a duration of time”). Instead I think it is important to emphasize that the characteristics are measured in time. And you have to be explicit whether you want simultaneous characteristics or not. That is, we want to push people to account for time in their queries, but not assign a specific duration of time that a phenotype has to last.

Mary_Regina_Boland · May 23, 2019, 3:28pm

Ultimately I think it depends on how clear you want the word ‘phenotype’ to be within the OHDSI framework. Obviously its up to you guys and you can call exposures, outcomes and diseases all phenotypes if you like - in a certain sense they each would fall within your definition of phenotype. Because the characteristics of a person that decides to take a particular medication could be a ‘phenotype’ - regardless of disease or outcome. However, if everything is a ‘phenotype’ it muddies the waters and makes it less clear what the definition means. I think it would make more sense to keep labels such as exposures, outcome, and disease - it would be a lot more clear to people what is being talked about. If everything is called a phenotype it will be more confusing - again up to you.
Some phenotypes have implicit durations (e.g., pregnancy) and others are permanent (e.g., death) - but you could leave it up to the cohort builders to represent and model their exposures, outcomes and diseases appropriately - this would increase flexibility

Mark_Danese · May 23, 2019, 6:37pm

I agree with @Mary_Regina_Boland. We have a lot of good words for everything already. Cohort, index date, inclusion, exclusion, baseline exposure/variable, outcome.

I think part of the struggle is what word to use to describe the implementation. When I say “diabetes is an inclusion criterion” there isn’t ambiguity at a conceptual level about what I am doing in my protocol. But the details of my implementation of “diabetes” are not clear. The implementation can be called an algorithm, definition, criterion, phenotype, computable phenotype, etc. I don’t love “phenotype” or “computable phenotype” to describe the implementation, but that is just my opinion.

I think using “phenotype” to describe a study cohort (the result of all inclusion and exclusion criteria) is even more confusing. Especially since we already have the word “cohort”.

My issue seems to be that the OHDSI term “cohort” is used slightly differently in the context of a protocol than I use it as part of my research. So, I will not belabor the point except to say that it might help to clarify this for others. (Thanks to @Chris_Knoll for going into so much detail on the implementation of a study using the OHDSI framework – that helped me understand that I was essentially using a different language.)

krfeeney · May 23, 2019, 11:26pm

When you put books in a library do you categorize them for leisure reading versus assigned school reading? Is a library not a place where the book exists and you, as a consumer, add the meaning of how the book is used?

A book is a book is a book. No?

We’re assembling a collection of books. Lest we forget books are not capable of reading themselves. Whether I read Homer’s Odyssey for summer fun or because I was mandated to do it for a course is, by all accounts, creating a level of intricacy that isn’t a library’s role.

Mark_Danese · May 24, 2019, 12:16am

That isn’t quite the analogy being made. Imagine we have a very good word like “book”. Then somebody else advocates that we use the word “file” instead. Because, after all, all books nowadays are created, stored, and accessed on computers as some kind of file. Hence, “file” is a better word to use.

In my opinion, book = algorithm, and phenotype = file. Phenotype isn’t “wrong”. But it has a very specific meaning in the context of our field of health and medicine – the set of observable characteristics of an individual resulting from the interaction of its genotype with the environment. Why use that word?