Let's talk about dark data — what it means and how to navigate it. Graphic by Miguel Tovar/University of Houston

Is it necessary to share ALL your data? Is transparency a good thing or does it make researchers “vulnerable,” as author Nathan Schneider suggests in the Chronicle of Higher Education article, “Why Researchers Shouldn’t Share All Their Data.”

Dark Data Defined

Dark data is defined as the universe of information an organization collects, processes and stores – oftentimes for compliance reasons. Dark data never makes it to the official publication part of the project. According to the Gartner Glossary, “storing and securing data typically incurs more expense (and sometimes greater risk) than value.”

This topic is reminiscent of the file drawer effect, a phenomenon which reflects the influence of the results of a study on whether or not the study is published. Negative results can be just as important as hypotheses that are proven.

Publication bias and the need to only publish positive research that supports the PI’s hypothesis, it can be argued, is not good science. According to an article in the Indian Journal of Anaesthesia, authors Priscilla Joys Nagarajan, et al., wrote: “It is speculated that every significant result in the published world has 19 non-significant counterparts in file drawers.” That’s one definition of dark data.

Total Transparency

But what to do with all your excess information that did not make it to publication, most likely because of various constraints? Should everything, meaning every little tidbit, be readily available to the research community?

Schneider doesn’t think it should be. In his article, he writes that he hides some findings in a paper notebook or behind a password, and he keeps interviews and transcripts offline altogether to protect his sources.

Open-source

Open-source software communities tend to regard total transparency as inherently good. What are the advantages of total transparency? You may make connections between projects that you wouldn’t have otherwise. You can easily reproduce a peer’s experiment. You can even become more meticulous in your note-taking and experimental methods since you know it’s not private information. Similarly, journalists will recognize this thought pattern as the recent, popular call to engage in “open journalism.” Essentially, an author’s entire writing and editing process can be recorded, step by step.

TMI

This trend has led researchers to open-source programs like Jupyter and GitHub. Open-source programs detail every change that occurs along a project’s timeline. Is unorganized, excessive amounts of unpublishable data really what transparency means? Or does it confuse those looking for meaningful research that is meticulously curated?

The Big Idea

And what about the “vulnerability” claim? Sharing every edit and every new direction taken opens a scientist up to scoffers and harassment, even. Dark data in industry even involves publishing salaries, which can feel unfair to underrepresented, marginalized populations.

In Model View Culture, Ellen Marie Dash wrote: “Let’s give safety and consent the absolute highest priority, with openness and transparency prioritized explicitly below those. This means digging deep, properly articulating in detail what problems you are trying to solve with openness and transparency, and handling them individually or in smaller groups.”

------

This article originally appeared on the University of Houston's The Big Idea. Sarah Hill, the author of this piece, is the communications manager for the UH Division of Research.

Ad Placement 300x100
Ad Placement 300x600

CultureMap Emails are Awesome

Texas tops ranking of best state for investors in new report

by the numbers

Texas ranks third on a new list of the best states for investors and startups.

Investment platform BrokerChooser weighed five factors to come up with its ranking:

  • 2024 Google search volume for terms related to investing
  • Number of investors
  • Number of businesses receiving investments in 2024
  • Total amount of capital invested in businesses in 2024
  • Percentage change in amount of investment from 2019 to 2024

Based on those figures, provided mostly by Crunchbase, Texas sits at No. 3 on the list, behind No. 1 California and No. 2 New York.

Especially noteworthy for Texas is its investment total for 2024: more than $164.5 billion. From 2019 to 2024, the state saw a 440 percent jump in business investments, according to BrokerChooser. The same percentages are 204 percent for California and 396 percent for New York.

“There is definitely development and diversification in the American investment landscape, with impressive growth in areas that used to fly under the radar,” says Adam Nasli, head analyst at BrokerChooser.

According to Crunchbase, funding for Texas startups is off to a strong start in 2025. In the first three months of this year, venture capital investors poured nearly $2.9 billion into Lone Star State companies, Crunchbase data shows. Crunchbase attributes that healthy dollar amount to “enthusiasm around cybersecurity, defense tech, robotics, and de-extincting mammoths.”

During the first quarter of this year, roughly two-thirds of VC funding in Texas went to just five companies, says Crunchbase. Those companies are Austin-based Apptronik, Austin-based Colossal Biosciences, Dallas-based Island, Austin-based NinjaOne, and Austin-based Saronic.

Autonomous truck company rolls out driverless Houston-Dallas route

up and running

Houston is helping drive the evolution of self-driving freight trucks.

In October, Aurora opened a more than 90,000-square-foot terminal at a Fallbrook Drive logistics hub in northwest Houston to support the launch of its first “lane” for driverless trucks—a Houston-to-Dallas route on the Interstate 45 corridor. Aurora opened its Dallas-area terminal in April and the company began regular driverless customer deliveries between the two Texas cities on April 27.

Close to half of all truck freight in Texas moves along I-45 between Houston and Dallas.

“Now, we are the first company to successfully and safely operate a commercial driverless trucking service on public roads. Riding in the back seat for our inaugural trip was an honor of a lifetime – the Aurora Driver performed perfectly and it’s a moment I’ll never forget,” Chris Urmson, CEO and co-founder of Pittsburgh-based Aurora, said in a news release.

Aurora produces software that controls autonomous vehicles and is known for its flagship product, the Aurora Driver. The software is installed in Volvo and Paccar trucks, the latter of which includes brands like Kenworth and Peterbilt.

Aurora previously hauled more than 75 loads per week under the supervision of vehicle operators from Houston to Dallas and Fort Worth to El Paso for customers in its pilot project, including FedEx, Uber Freight and Werner. To date, it has completed over 1,200 miles without a driver.

The company launched its new Houston to Dallas route with customers Uber Freight and Hirschbach Motor Lines, which ran supervised commercial pilots with Aurora.

“Transforming an old school industry like trucking is never easy, but we can’t ignore the safety and efficiency benefits this technology can deliver. Autonomous trucks aren’t just going to help grow our business – they’re also going to give our drivers better lives by handling the lengthier and less desirable routes,” Richard Stocking, CEO of Hirschbach Motor Lines, added in the statement.

The company plans to expand its service to El Paso and Phoenix by the end of 2025.

“These new, autonomous semis on the I-45 corridor will efficiently move products, create jobs, and help make our roadways safer,” Gov. Greg Abbott added in the release. “Texas offers businesses the freedom to succeed, and the Aurora Driver will further spur economic growth and job creation in Texas. Together through innovation, we will build a stronger, more prosperous Texas for generations.”

In July, Aurora said it raised $820 million in capital to fuel its growth—growth that’s being accompanied by scrutiny.

In light of recent controversies surrounding self-driving vehicles, the International Brotherhood of Teamsters, whose union members include over-the-road truckers, recently sent a letter to Lt. Gov. Dan Patrick calling for a ban on autonomous vehicles in Texas.

“The Teamsters believe that a human operator is needed in every vehicle—and that goes beyond partisan politics,” the letter states. “State legislators have a solemn duty in this matter to keep dangerous autonomous vehicles off our streets and keep Texans safe. Autonomous vehicles are not ready for prime time, and we urge you to act before someone in our community gets killed.”

Houston cell therapy company launches second-phase clinical trial

fighting cancer

A Houston cell therapy company has dosed its first patient in a Phase 2 clinical trial. March Biosciences is testing the efficacy of MB-105, a CD5-targeted CAR-T cell therapy for patients with relapsed or refractory CD5-positive T-cell lymphoma.

Last year, InnovationMap reported that March Biosciences had closed its series A with a $28.4 million raise. Now, the company, co-founded by Sarah Hein, Max Mamonkin and Malcolm Brenner, is ready to enroll a total of 46 patients in its study of people with difficult-to-treat cancer.

The trial will be conducted at cancer centers around the United States, but the first dose took place locally, at The University of Texas MD Anderson Cancer Center. Dr. Swaminathan P. Iyer, a professor in the department of lymphoma/myeloma at MD Anderson, is leading the trial.

“This represents a significant milestone in advancing MB-105 as a potential treatment option for patients with T-cell lymphoma who currently face extremely limited therapeutic choices,” Hein, who serves as CEO, says. “CAR-T therapies have revolutionized the treatment of B-cell lymphomas and leukemias but have not successfully addressed the rarer T-cell lymphomas and leukemias. We are optimistic that this larger trial will further validate MB-105's potential to address the critical unmet needs of these patients and look forward to reporting our first clinical readouts.”

The Phase 1 trial showed promise for MB-105 in terms of both safety and efficacy. That means that potentially concerning side effects, including neurological events and cytokine release above grade 3, were not observed. Those results were published last year, noting lasting remissions.

In January 2025, MB-105 won an orphan drug designation from the FDA. That results in seven years of market exclusivity if the drug is approved, as well as development incentives along the way.

The trial is enrolling its single-arm, two-stage study on ClinicalTrials.gov. For patients with stubborn blood cancers, the drug is providing new hope.