Let's talk about dark data — what it means and how to navigate it. Graphic by Miguel Tovar/University of Houston

Is it necessary to share ALL your data? Is transparency a good thing or does it make researchers “vulnerable,” as author Nathan Schneider suggests in the Chronicle of Higher Education article, “Why Researchers Shouldn’t Share All Their Data.”

Dark Data Defined

Dark data is defined as the universe of information an organization collects, processes and stores – oftentimes for compliance reasons. Dark data never makes it to the official publication part of the project. According to the Gartner Glossary, “storing and securing data typically incurs more expense (and sometimes greater risk) than value.”

This topic is reminiscent of the file drawer effect, a phenomenon which reflects the influence of the results of a study on whether or not the study is published. Negative results can be just as important as hypotheses that are proven.

Publication bias and the need to only publish positive research that supports the PI’s hypothesis, it can be argued, is not good science. According to an article in the Indian Journal of Anaesthesia, authors Priscilla Joys Nagarajan, et al., wrote: “It is speculated that every significant result in the published world has 19 non-significant counterparts in file drawers.” That’s one definition of dark data.

Total Transparency

But what to do with all your excess information that did not make it to publication, most likely because of various constraints? Should everything, meaning every little tidbit, be readily available to the research community?

Schneider doesn’t think it should be. In his article, he writes that he hides some findings in a paper notebook or behind a password, and he keeps interviews and transcripts offline altogether to protect his sources.

Open-source

Open-source software communities tend to regard total transparency as inherently good. What are the advantages of total transparency? You may make connections between projects that you wouldn’t have otherwise. You can easily reproduce a peer’s experiment. You can even become more meticulous in your note-taking and experimental methods since you know it’s not private information. Similarly, journalists will recognize this thought pattern as the recent, popular call to engage in “open journalism.” Essentially, an author’s entire writing and editing process can be recorded, step by step.

TMI

This trend has led researchers to open-source programs like Jupyter and GitHub. Open-source programs detail every change that occurs along a project’s timeline. Is unorganized, excessive amounts of unpublishable data really what transparency means? Or does it confuse those looking for meaningful research that is meticulously curated?

The Big Idea

And what about the “vulnerability” claim? Sharing every edit and every new direction taken opens a scientist up to scoffers and harassment, even. Dark data in industry even involves publishing salaries, which can feel unfair to underrepresented, marginalized populations.

In Model View Culture, Ellen Marie Dash wrote: “Let’s give safety and consent the absolute highest priority, with openness and transparency prioritized explicitly below those. This means digging deep, properly articulating in detail what problems you are trying to solve with openness and transparency, and handling them individually or in smaller groups.”

------

This article originally appeared on the University of Houston's The Big Idea. Sarah Hill, the author of this piece, is the communications manager for the UH Division of Research.

Ad Placement 300x100
Ad Placement 300x600

CultureMap Emails are Awesome

Houston university to launch artificial intelligence major, one of first in nation

BS in AI

Rice University announced this month that it plans to introduce a Bachelor of Science in AI in the fall 2025 semester.

The new degree program will be part of the university's department of computer science in the George R. Brown School of Engineering and Computing and is one of only a few like it in the country. It aims to focus on "responsible and interdisciplinary approaches to AI," according to a news release from the university.

“We are in a moment of rapid transformation driven by AI, and Rice is committed to preparing students not just to participate in that future but to shape it responsibly,” Amy Dittmar, the Howard R. Hughes Provost and executive vice president for academic affairs, said in the release. “This new major builds on our strengths in computing and education and is a vital part of our broader vision to lead in ethical AI and deliver real-world solutions across health, sustainability and resilient communities.”

John Greiner, an assistant teaching professor of computer science in Rice's online Master of Computer Science program, will serve as the new program's director. Vicente Ordóñez-Román, an associate professor of computer science, was also instrumental in developing and approving the new major.

Until now, Rice students could study AI through elective courses and an advanced degree. The new bachelor's degree program opens up deeper learning opportunities to undergrads by blending traditional engineering and math requirements with other courses on ethics and philosophy as they relate to AI.

“With the major, we’re really setting out a curriculum that makes sense as a whole,” Greiner said in the release. “We are not simply taking a collection of courses that have been created already and putting a new wrapper around them. We’re actually creating a brand new curriculum. Most of the required courses are brand new courses designed for this major.”

Students in the program will also benefit from resources through Rice’s growing AI ecosystem, like the Ken Kennedy Institute, which focuses on AI solutions and ethical AI. The university also opened its new AI-focused "innovation factory," Rice Nexus, earlier this year.

“We have been building expertise in artificial intelligence,” Ordóñez-Román added in the release. “There are people working here on natural language processing, information retrieval systems for machine learning, more theoretical machine learning, quantum machine learning. We have a lot of expertise in these areas, and I think we’re trying to leverage that strength we’re building.”

Houston biomanufacturing accelerator adds pilot plant to support scale-ups

new digs

Houston accelerator BioWell announced this month that it has taken over operations of Texas BioTechnology’s pilot plant in Richmond, Texas.

The 33,000-square-foot facility is one of the largest of its kind in the U.S. and features molecular biology labs, advanced automation, fermentation equipment and 16 dedicated benches for early-stage industrial biomanufacturing companies, according to a release from the company. It will allow BioWell to offer on-site education, workforce development, and lab training for students and workers.

BioWell and its founding company, First Bight Ventures, report that the facility should help address the industry's "scale-up bottleneck due to limited pilot- and demonstration-scale infrastructure" in the U.S.

"As a Houston-based accelerator dedicated exclusively to early-stage biomanufacturing startups, partnering with this facility was a natural and highly strategic decision for us. The site is fully operational and offers a strong platform to support biomanufacturing companies, industry leaders, and research institutions, providing critical expertise and infrastructure across a broad range of biotechnology production processes,” Veronica Breckenridge, founder of First Bight Ventures and BioWell, said in a news release.

First Bight Ventures shares that the partnership with the facility will also allow it to better support its portfolio companies and make them more attractive to future investors.

BioWell will host an open house and tours of the fermentation and lab spaces and an overview of current bioindustrial projects Wednesday, May 28, at 10:30 a.m. and 2 p.m. RSVPs are required.

BioWell was originally funded by a $700,000 U.S. Economic Development Administration’s Build to Scale grant and launched as a virtual accelerator for bioindustrial startups. Listen to an interview with Carlos Estrada, head of venture acceleration at BioWell, here.

Ultra-fast EV charging bays coming to Waffle House locations in Texas and beyond

power breakfast

Scattered, smothered and ... charged?

Starting next year, EV drivers can connect to ultra-fast charging stations at select Waffle House locations throughout Texas, courtesy of bp pulse.

The EV arm of British energy giant bp announced a strategic partnership with the all-day breakfast chain this week. The company aims to deploy a network of 400kW DC fast chargers and a mix of CCS and NACS connectors at Waffle House locations in Texas, Georgia, Florida, and other restaurants in the South.

Each Waffle House site will feature six ultra-fast EV charging bays, allowing drivers to "(enjoy) Waffle House’s 24/7 amenities," the announcement reads.

“Adding an iconic landmark like Waffle House to our growing portfolio of EV charging sites is such an exciting opportunity. As an integrated energy company, bp is committed to providing efficient solutions like ultra-fast charging to support our customers’ mobility needs," Sujay Sharma, CEO of bp pulse U.S., said in a news release. "We’re building a robust network of ultra-fast chargers across the country, and this is another example of third-party collaborations enabling access to charging co-located with convenient amenities for EV drivers.”

The news comes as bp pulse continues to grow its charging network in Texas.

The company debuted its new high-speed electric vehicle charging site, known as the Gigahub, at the bp America headquarters in Houston last year. In partnership with Hertz Electrifies Houston, it also previously announced plans to install a new EV fast-charging hub at Hobby Airport. In a recent partnership with Simon Malls, bp also shared plans to install EV charging Gigahubs at The Galleria and Katy Mills Mall.

bp has previously reported that it plans to invest $1 billion in EV charging infrastructure by 2030, with $500 million invested by the end of 2025.

---

A version of this article originally appeared on EnergyCapitalHTX.com.