Let's talk about dark data — what it means and how to navigate it. Graphic byMiguel Tovar/University of Houston

Is it necessary to share ALL your data? Is transparency a good thing or does it make researchers “vulnerable,” as author Nathan Schneider suggests in the Chronicle of Higher Education article, “Why Researchers Shouldn’t Share All Their Data.”

Dark Data Defined

Dark data is defined as the universe of information an organization collects, processes and stores – oftentimes for compliance reasons. Dark data never makes it to the official publication part of the project. According to the Gartner Glossary, “storing and securing data typically incurs more expense (and sometimes greater risk) than value.”

This topic is reminiscent of the file drawer effect, a phenomenon which reflects the influence of the results of a study on whether or not the study is published. Negative results can be just as important as hypotheses that are proven.

Publication bias and the need to only publish positive research that supports the PI’s hypothesis, it can be argued, is not good science. According to an article in the Indian Journal of Anaesthesia, authors Priscilla Joys Nagarajan, et al., wrote: “It is speculated that every significant result in the published world has 19 non-significant counterparts in file drawers.” That’s one definition of dark data.

Total Transparency

But what to do with all your excess information that did not make it to publication, most likely because of various constraints? Should everything, meaning every little tidbit, be readily available to the research community?

Schneider doesn’t think it should be. In his article, he writes that he hides some findings in a paper notebook or behind a password, and he keeps interviews and transcripts offline altogether to protect his sources.

Open-source

Open-source software communities tend to regard total transparency as inherently good. What are the advantages of total transparency? You may make connections between projects that you wouldn’t have otherwise. You can easily reproduce a peer’s experiment. You can even become more meticulous in your note-taking and experimental methods since you know it’s not private information. Similarly, journalists will recognize this thought pattern as the recent, popular call to engage in “open journalism.” Essentially, an author’s entire writing and editing process can be recorded, step by step.

TMI

This trend has led researchers to open-source programs like Jupyter and GitHub. Open-source programs detail every change that occurs along a project’s timeline. Is unorganized, excessive amounts of unpublishable data really what transparency means? Or does it confuse those looking for meaningful research that is meticulously curated?

The Big Idea

And what about the “vulnerability” claim? Sharing every edit and every new direction taken opens a scientist up to scoffers and harassment, even. Dark data in industry even involves publishing salaries, which can feel unfair to underrepresented, marginalized populations.

In Model View Culture, Ellen Marie Dash wrote: “Let’s give safety and consent the absolute highest priority, with openness and transparency prioritized explicitly below those. This means digging deep, properly articulating in detail what problems you are trying to solve with openness and transparency, and handling them individually or in smaller groups.”

------

This article originally appeared on the University of Houston's The Big Idea. Sarah Hill, the author of this piece, is the communications manager for the UH Division of Research.

Ad Placement 300x100
Ad Placement 300x600

CultureMap Emails are Awesome

Innovative Houston nonprofit partners with county organization to provide maternal health services

TEAM WORK

PUSH Birth Partners, a Houston-based maternal health nonprofit, is teaming up with the Harris County Public Health Department to provide doula services for over 200 pregnant people free of cost.

Jacqueline McLeeland, CEO and founder of PUSH, says the program will begin in August and aims to improve maternal health and birth outcomes for vulnerable populations. McLeeland says the organization has built up a strong doula training program through their collective in partnership with March of Dimes and several local doula organizations.

McLeeland says PUSH aims to address poor maternal health outcomes for women of color in part by training more doulas of color who can help reduce racial disparities in care. A 2021 study by Harris County Public Health found Precinct 1, which is predominantly composed of people of color, had the highest maternal mortality rate of the county.

Through their collective, PUSH has trained two cohorts of doulas through an integrated care model, focused on providing collaborative care with medical providers in the healthcare system.

“Our programs are designed to advance health equity, we see the numbers, we see that women of color, specifically Black women in that group are disproportionately impacted,” McLeeland tells InnovationMap.

After receiving a $100,000 grant from the Episcopal Health Foundation in 2023, PUSH began their doula expansion program in Houston and they have since received an additional grant from EHF for the next fiscal year. McLeeland shares PUSH has also launched a pilot program called Blossoming Beyond Birth, sponsored by the Rockwell Fund, targeted towards improving maternal mental health through weekly support groups in Houston.

“It’s very exciting to know that we have come this far from where we started and to see how everything is coming together,” McLeeland shares.

Jacqueline McLeeland serves as chief executive and founder of non-profit PUSH Birth Partners who has trained and collaborated with a network of doulas for the partnership. Photo courtesy of Jacqueline McLeeland

For McLeeland, improving maternal health outcomes and providing support to people experiencing high-risk pregnancies are deeply personal goals. McLeeland has sickle cell anemia, a condition that can cause serious complications during pregnancy. During her first pregnancy in 2015, McLeeland was placed on bed rest two months before her due date at which point she had been working in clinical research within the pharmaceutical industry for over 12 years.

“People don’t realize the magnitude of what women go through, during pregnancy and after,” McLeeland says. “There’s a lot of emotional, psychological, and physical tolls depending on how the pregnancy and delivery went.”

After giving birth to her first child, McLeeland took maternity leave, during which she began to research maternal morbidity and mortality trends, information which she says was not widely discussed at the time.

McLeeland says entering the maternal healthcare field felt like a necessity following her second pregnancy. Several months after giving birth to her second child, McLeeland says she received a bill for a surgical procedure that was performed during her cesarean section without her or her husband’s consent. McLeeland says that was the first time she was made aware of the surgery.

“The procedure that was claimed to have been performed could have put my life in jeopardy by hemorrhaging based off of additional research I did once, I came across that information,” McLeeland explains. “These are some of the things that happen in the healthcare system that make people skeptical of trusting in the healthcare system, trusting in doctors.”

McLeeland says the key to improving maternal and birth outcomes for vulnerable populations is to encourage the partnership between doulas, community healthcare workers, and physicians and hopes to further this collaboration through future programming.

Houston-based clean energy site developer raises $300M to decarbonize big tech projects

fresh funding

Houston energy executives have started a new company dedicated to developing clean-powered infrastructure for the large electric loads.

Cloverleaf Infrastructure, dually headquartered in Houston and Seattle, Washington, announced its launch and $300 million raised from NGP and Sandbrook Capital, two private equity firms. The company's management team also invested in the company.

As emerging technology continues to grow electricity load demand, Cloverleaf has identified an opportunity to develop large-scale digital infrastructure sites powered by low-carbon electricity.

"The rapid growth in demand for electricity to power cloud computing and artificial intelligence poses a major climate risk if fueled by high-emission fossil fuels," David Berry, Cloverleaf's CEO, says in a news release. "However, it's also a major opportunity to catalyze the modernization of the US grid and the transition to a smarter and more sustainable electricity system through a novel approach to development.

"Cloverleaf is committed to making this vision a reality with the support of leading climate investors like Sandbrook and NGP."

Berry, who's based in Houston, previously co-founded and served as CFO at ConnectGen and Clean Line Energy Partners, clean energy and transmission developers. Last year, he co-founded Cloverleaf with Seattle-based Brian Janous and CTO Jonathan Abebe, who most recently held a senior role at the United States Department of Energy. Nur Bernhardt, director of Energy Strategy at Microsoft who's also based in Seattle, rounds out the executive team as vice president.

"The large tech companies have become dominant players in the electricity sector, and they are genuinely determined to power their growth with the lowest possible emissions," Janous, who serves as chief commercial officer, says in the release. "Achieving this objective doesn't depend on disruptive new technologies as much as it does on dedicated teams working hand in hand with utility partners to maximize the use of the clean generation, storage, and other technologies we already have."

Cloverleaf will work with regional U.S. utilities and data center operators to provide clean electricity at scale through strategic investments in transmission, grid interconnection, land, onsite power generation, and electricity storage, per the release.

"The sustainable development of digital infrastructure at scale is fundamentally a technical power problem," Alfredo Marti, partner at Sandbrook, adds. "We have witnessed members of the Cloverleaf team effectively address this challenge for many years through a blend of creativity, specialized engineering, a partnership mindset, and astute capital deployment."

------

This article originally ran on EnergyCapital.

Houston resilience tech innovator proves out platform amid Hurricane Beryl

HOUSTON INNOVATORS PODCAST EPISODE 245

Earlier this month, Ali Mostafavi got an unexpected chance to pilot his company's data-backed and artificial intelligence-powered platform — all while weathering one of Houston's most impactful storms.

Mostafavi, a civil and environmental engineering professor at Texas A&M University, founded Resilitix.AI two years ago, and with the help of his lab at A&M, has created a platform that brings publicly available data into AI algorithms to provide its partners near-real time information in storm settings.

As Hurricane Beryl came ashore with Houston on its path, Mostafavi says he had the opportunity to both test his technology and provide valuable information to his community during the storm.

"We were in the process of fine tuning some of our methods and algorithms behind our technology," Mostafavi says on the Houston Innovators Podcast. "When disasters happen, you go to activation mode. We put our technology development and R&D efforts on hold and try to test our technology in an operational setting."

The platform provides its partners — right now, those include local and state organizations and emergency response teams — information on evacuation reports, street flooding, and even damage sustained based on satellite imagery. Mostafavi says that during Beryl, users were wondering how citizens were faring amid rising temperatures and power outages. The Resilitix team quickly pivoted to apply algorithms to hospital data to see which neighborhoods were experiencing high volumes of patients.

"We had the ability to innovate on the spot," Mostafavi says, adding that his own lack of power and internet was an additional challenge for the company. "When an event happens, we start receiving requests and questions. ... We had to be agile and adapt our methods to be responsive. Then at the same time, because we haven't tested it, we have to verify that we are confident (in the information we provide)."

On the episode, Mostafavi shares how Hurricane Harvey — which occurred shortly after Mostafavi moved to Houston — inspired the foundation of Resilitix and how Houston is the ideal spot to grow the company.

"We are very excited that our company is Houston based," he says. "We should not be just ground zero of disasters. We have to also be ground zero for solutions as well. I believe Houston should be the hub for resilience tech innovation as it is for energy transition.

"I think energy transition, climatetech, energy tech, and disaster tech go hand in hand," Mostafavi continues. "I feel that we are in the right place."