Let's talk about dark data — what it means and how to navigate it. Graphic by Miguel Tovar/University of Houston

Is it necessary to share ALL your data? Is transparency a good thing or does it make researchers “vulnerable,” as author Nathan Schneider suggests in the Chronicle of Higher Education article, “Why Researchers Shouldn’t Share All Their Data.”

Dark Data Defined

Dark data is defined as the universe of information an organization collects, processes and stores – oftentimes for compliance reasons. Dark data never makes it to the official publication part of the project. According to the Gartner Glossary, “storing and securing data typically incurs more expense (and sometimes greater risk) than value.”

This topic is reminiscent of the file drawer effect, a phenomenon which reflects the influence of the results of a study on whether or not the study is published. Negative results can be just as important as hypotheses that are proven.

Publication bias and the need to only publish positive research that supports the PI’s hypothesis, it can be argued, is not good science. According to an article in the Indian Journal of Anaesthesia, authors Priscilla Joys Nagarajan, et al., wrote: “It is speculated that every significant result in the published world has 19 non-significant counterparts in file drawers.” That’s one definition of dark data.

Total Transparency

But what to do with all your excess information that did not make it to publication, most likely because of various constraints? Should everything, meaning every little tidbit, be readily available to the research community?

Schneider doesn’t think it should be. In his article, he writes that he hides some findings in a paper notebook or behind a password, and he keeps interviews and transcripts offline altogether to protect his sources.

Open-source

Open-source software communities tend to regard total transparency as inherently good. What are the advantages of total transparency? You may make connections between projects that you wouldn’t have otherwise. You can easily reproduce a peer’s experiment. You can even become more meticulous in your note-taking and experimental methods since you know it’s not private information. Similarly, journalists will recognize this thought pattern as the recent, popular call to engage in “open journalism.” Essentially, an author’s entire writing and editing process can be recorded, step by step.

TMI

This trend has led researchers to open-source programs like Jupyter and GitHub. Open-source programs detail every change that occurs along a project’s timeline. Is unorganized, excessive amounts of unpublishable data really what transparency means? Or does it confuse those looking for meaningful research that is meticulously curated?

The Big Idea

And what about the “vulnerability” claim? Sharing every edit and every new direction taken opens a scientist up to scoffers and harassment, even. Dark data in industry even involves publishing salaries, which can feel unfair to underrepresented, marginalized populations.

In Model View Culture, Ellen Marie Dash wrote: “Let’s give safety and consent the absolute highest priority, with openness and transparency prioritized explicitly below those. This means digging deep, properly articulating in detail what problems you are trying to solve with openness and transparency, and handling them individually or in smaller groups.”

------

This article originally appeared on the University of Houston's The Big Idea. Sarah Hill, the author of this piece, is the communications manager for the UH Division of Research.

Ad Placement 300x100
Ad Placement 300x600

CultureMap Emails are Awesome

Pharma giant considers Houston for $1B manufacturing campus

in the works

Another pharmaceutical giant is considering Houston’s Generation Park for a manufacturing hub.

According to a recent filing with the Texas Jobs, Energy, Technology and Innovation (JETI) program, Bristol Myers Squibb Co. is considering the northeast Houston management district for a new $1 billion multi-modal pharmaceutical manufacturing campus.

If approved, the campus, known as Project Argonaut, could create 489 jobs in Texas by 2031. Jobs would include operations technicians, engineering roles, administrative and management roles, production specialists, maintenance support, and quality control/assurance. The company predicts annual average wages for these positions to be around $96,000, according to the filing.

The project currently includes the 600,000-square-foot facility, but according to the filing, Bristol Myers Squibb “envisions this site growing in scale and capability well beyond its opening configuration."

The Texas JETI program offers companies temporary school property tax limitations in exchange for major capital investment and job creation. E.R. Squibb & Sons LLC applied for a 10-year tax abatement agreement in the Sheldon Independent School District.

The agreement promises a $ 1 billion investment. Construction would begin in 2027 and wrap in 2029.

“The proposed project reflects [Bristol Myers Squibb Co.’s] enduring commitment to bringing innovative medicines to patients and ensuring the long-term supply reliability they depend on,” the filing says. “The proposed project is purpose-built to support and manufacture medicines spanning multiple therapeutic areas and modalities, positioning the site as a long-term launch and commercial campus for decades to come. These medicines will provide therapies to the [Bristol Myers Squibb Co.’s] patients located in markets both nationally and internationally.”

The Fortune 100 company is considering 16 other cities for the new manufacturing facility in the Central and Eastern markets in the U.S. According to the Houston Chronicle, Bristol Myers Squibb Co is still in the “evaluation process” for its potential manufacturing site.

Last fall, Eli Lilly and Co. selected Generation Park for its $6.5 billion manufacturing plant. More than 300 locations in the U.S. competed for the factory. Read more here.

Houston health tech co. lands NIH grant for AI cancer prediction tool

fresh funding

Houston-based CellChorus and Stanford Medicine were recently awarded a Phase I Small Business Innovation Research grant for the company's AI platform to test how certain cancer patients will respond to therapies.

The funding comes from the National Cancer Institute of the National Institutes of Health. According to a filing, the grant totaled just under $400,000.

CellChorus, which spun out from the University of Houston’s Technology Bridge, has developed TIMING (Time-lapse Imaging Microscopy In Nanowell Grids), which analyzes the behavior of thousands of individual immune cells over time and can identify early indicators of treatment success or failure.

The company will work with Stanford's Dr. David Miklos and Dr. Saurabh Dahiya, who have built the Bone Marrow Transplantation and Cell Therapy Biobank. The biobank manages and stores biological samples from patients treated at their clinic and in clinical trials.

"Predicting which patients will achieve durable responses after CAR-T therapy remains one of the most important challenges in the field,” Miklos said in a news release. “We aim to uncover functional cellular signatures that can guide treatment decisions and improve patient outcomes.”

The project will specifically profile cells from patients with relapsed/refractory large B-cell lymphoma (r/rLBCL). According to CellChorus, only about half of r/rLBCL patients who receive CAR-T therapy "achieve a durable, long-term remission." Others do not respond to therapy or experience relapse.

“The sooner we know whether a cancer therapy is working, the better. To maximize patient benefit, we need technology that can provide a robust and early prediction of response to therapy. The technology needs to be scalable, cost-efficient, and capable of rapid turnaround times,” Rebecca Berdeaux, chief scientific officer of CellChorus, added in the release. “We are excited to work with Drs. David Miklos and Saurabh Dahiya and their colleagues on this very important project.”

CellChorus has previously received SBIR grants from federal agencies, including a $2.5 million award in 2024 from its National Center for Advancing Translational Sciences (NCATS) and a $2.3 million SBIR Fast-Track award from the National Institute of General Medical Sciences in 2023.

Houston museum showcases America's founding documents in rare exhibit

Experience History

As the United States prepares to celebrate its 250th birthday, Houstonians have a chance to see rare documents from the founding of the nation. Freedom Plane National Tour: Documents That Forged a Nation, presented by the National Archives Foundation, will be on display at the Houston Museum of Natural Science through Monday, May 25.

The collection includes a rare engraving of the original Declaration of Independence; official Oaths of Allegiance signed by George Washington, Aaron Burr, and Alexander Hamilton; a draft of the Bill of Rights; the Treaty of Paris, the documented that recognized America's independence from Great Britain; and the tally of votes approving the Constitution.

The National Archives specifically chose Houston as one of only eight cities in the country to host the exhibit as a means to help the documents reach a wider audience outside of the main hub of semiquincentennial events in New England and the Washington, D.C. area.

"One of the things we decided when we put the tour together because we wanted to be off the East Coast," said Patrick Madden, CEO of the National Archives Foundation, who was onsite for the exhibit's opening in Houston. "There's a lot of 250th celebration stuff happening in the original 13 colonies. How do we get it to major markets where larger numbers of people can see it? So in the case of Houston, obviously, [is a] major market in this part of the country, but also we've partnered with the museum twice before with National Archives exhibits, so we knew that they would be up to the task of handling the exhibit and the crowds."

The star of the collection is a rare engraving of the original Declaration of Independence. Secretary of State and future president John Quincy Adams commissioned 200 exact replicas of the document from engraver William J. Stone in 1823. Less than 50 now remain. Madden joyfully pointed out that there are errors in this document, a potent reminder that the men who forged a nation made mistakes.

"There's a couple of typos in it where they had to make corrections," said Madden. "So even the founders, you know, they're all human. That resonates because here these people are making this move against the most powerful nation in the world and putting their lives on the line for a country based on ideas."

Other impressive parts of the collection include official Oaths of Allegiance signed by George Washington, Aaron Burr, and Alexander Hamilton, as well as one of the drafts of the Bill of Rights. Many states would not ratify the Constitution until certain rights were included in the document, leading to Washington going on a national tour assuring state leaders enshrining protections was first on the list. The draft copy on display specifically shows the First Amendment in progress.

Houston is the fourth stop on the exhibition's tour, which will take the documents to Denver, Miami, Dearborn, and Seattle through the summer. Freedom Plane is just one part of a larger patriotic celebration at the HMNS, which includes a film series celebrating American science and culture and general Americana decoration throughout the main hall.

Admission to Freedom Plane is free to the public, but separate from general admission to the museum. Space is limited, and passes are available on a first-come, first-serve basis. Non-members should expect long waits or the possibility that the day's passes are sold out. Only museum members can reserve passes for specific times. Flash photography is prohibited due to the fragile nature of the documents.

---

This article originally appeared on CultureMap.com.