The Future of Health Data Management: Enabling a Trusted Research Environment
Complete the form below to unlock access to ALL audio articles.
Increased access to health research data allows scientists and researchers to uncover new findings about diseases and treatments they might not have had access to before. This data, based on genomic markers, is the key piece to medicine-making and patient diagnosing.
In a recent study, researchers at Stanford University were able to break the world record of diagnosing a patient with a rare disease, in five hours and two minutes. By contrast, a typical diagnosis of rare diseases could take up to four years – and children typically must wait six to eight years before being diagnosed.
Shortening the timeline for diagnosis is clearly a critical factor in living a longer and healthier life.
The hurdle to speeding the pace of diagnosis is that health data is often held and accessed by a single group or organization (“silos,” in other words), and patient confidentiality makes data-sharing problematic. To overcome this hurdle, researchers and organizations are leaning into a relatively new method of health data management, by establishing trusted research environments (TREs).
TRE is becoming a commonly used acronym among the science and research community. In general, a TRE is a centralized
This is a very different approach from the traditional ways in which researchers access data. Historically, researchers have had to download an entire dataset onto their computer to be able to study the findings. Transferring and releasing data in this way increases the risk of security problems, even though the individuals have been de-identified. Furthermore, this method takes a considerable amount of time – time that could be better spent on analysis of clinical data sets.
Why the shift?
The COVID-19 pandemic revealed that patient clinical data availability and standardization was key to finding out more about the virus, and how to target it head-on. Researchers from all over the world were running experiments, analyzing their findings, collecting clinical data sets, and reporting on their outcomes.
During this time, organizations became more aware of the pressing need for a new way to manage health data. Specifically, the UK Health Security Agency started collecting whole genome sequencing back in 2020 for COVID-infected patients. Recently the agency has just passed
Global impact of limited access
TREs are becoming the architectural backbone for health data in many research organizations. While this is a step in the right direction, many TREs still can’t speak to colleagues from other organizations, or even other departments within their own organization.
For example, some universities have their own research departments, each with its own TRE. There have been unfortunately common situations where TREs that are only a wall apart in an organization can’t “speak” to one another. Without this ability, it is impossible to take full advantage of a TRE.
As the genomic sector continues to grow, the capability of TREs to
That doesn’t mean moving data. Life sciences data sets are too large to move efficiently – and to complicate matters, many data security regulations forbid data to leave an organization, state or nation. Consequently, it is estimated that as much as 80–90 percent of important datasets are simply unavailable to research.
What is required is a shift from data centralization in silos to a means for allowing data to be shared while in situ with the organizations that gathered it in the first place. No alternative is as promising for research.
What constitutes as a trusted research environment?
There are several factors that organizations need to consider when they set off on the challenge of developing a trusted research environment.
1. Safe people
Users need to be approved and have the appropriate credentials to access the health data. Individuals should not be trying to re-identify individuals, as that would be a breach of patient confidentiality, or give another party access through their credentials. Researchers and scientists must be properly trained on using the TRE platform.
2. Safe projects
Even though the TREs hold secure and sensitive information, it is essential that the data that is being used must be relevant and used to positively benefit public health. In order to achieve this, TREs must have auditing in place to ensure compliance.
3. Safe setting
Cloud technology should never let data leave the database or export any findings to the users. Researchers should have the ability to bring in their own algorithms for analysis, but any tools that are imputed into the system must be contained in “airlock” mode. This feature allows for tools to be scanned so that the security of the TRE is not affected. Ensuring safe setting
4. Safe data
Data within the TRE must be secure and safe, so that patients are de-identified and there is no possibility of researchers re-identifying the information. The quality of data has to be cleaned and verified as well, so that the appropriate data can be relevant to the approved project. The value of safe data can open up new research opportunities that will benefit the general public.
5. Safe outputs
As mentioned in Safe setting
When TREs meet all five of these requirements, organizations are enabling a fully trusted research environment.
Genomic health data brings unique challenges when it comes to storage, management, analysis and collaboration, due both to the scale of the datasets and the sensitivity of what’s contained in them. TREs are becoming the architectural structure to bridge the gap for health data so that the information can be scaled and secured.