Let's talk about dark data — what it means and how to navigate it. Graphic by Miguel Tovar/University of Houston

Is it necessary to share ALL your data? Is transparency a good thing or does it make researchers “vulnerable,” as author Nathan Schneider suggests in the Chronicle of Higher Education article, “Why Researchers Shouldn’t Share All Their Data.”

Dark Data Defined

Dark data is defined as the universe of information an organization collects, processes and stores – oftentimes for compliance reasons. Dark data never makes it to the official publication part of the project. According to the Gartner Glossary, “storing and securing data typically incurs more expense (and sometimes greater risk) than value.”

This topic is reminiscent of the file drawer effect, a phenomenon which reflects the influence of the results of a study on whether or not the study is published. Negative results can be just as important as hypotheses that are proven.

Publication bias and the need to only publish positive research that supports the PI’s hypothesis, it can be argued, is not good science. According to an article in the Indian Journal of Anaesthesia, authors Priscilla Joys Nagarajan, et al., wrote: “It is speculated that every significant result in the published world has 19 non-significant counterparts in file drawers.” That’s one definition of dark data.

Total Transparency

But what to do with all your excess information that did not make it to publication, most likely because of various constraints? Should everything, meaning every little tidbit, be readily available to the research community?

Schneider doesn’t think it should be. In his article, he writes that he hides some findings in a paper notebook or behind a password, and he keeps interviews and transcripts offline altogether to protect his sources.

Open-source

Open-source software communities tend to regard total transparency as inherently good. What are the advantages of total transparency? You may make connections between projects that you wouldn’t have otherwise. You can easily reproduce a peer’s experiment. You can even become more meticulous in your note-taking and experimental methods since you know it’s not private information. Similarly, journalists will recognize this thought pattern as the recent, popular call to engage in “open journalism.” Essentially, an author’s entire writing and editing process can be recorded, step by step.

TMI

This trend has led researchers to open-source programs like Jupyter and GitHub. Open-source programs detail every change that occurs along a project’s timeline. Is unorganized, excessive amounts of unpublishable data really what transparency means? Or does it confuse those looking for meaningful research that is meticulously curated?

The Big Idea

And what about the “vulnerability” claim? Sharing every edit and every new direction taken opens a scientist up to scoffers and harassment, even. Dark data in industry even involves publishing salaries, which can feel unfair to underrepresented, marginalized populations.

In Model View Culture, Ellen Marie Dash wrote: “Let’s give safety and consent the absolute highest priority, with openness and transparency prioritized explicitly below those. This means digging deep, properly articulating in detail what problems you are trying to solve with openness and transparency, and handling them individually or in smaller groups.”

------

This article originally appeared on the University of Houston's The Big Idea. Sarah Hill, the author of this piece, is the communications manager for the UH Division of Research.

Ad Placement 300x100
Ad Placement 300x600

CultureMap Emails are Awesome

Rice University MBA programs rank among top 5 in prestigious annual report

top of class

Rice University’s Jones Graduate School of Business MBA programs have been ranked among the top five in the country again in The Princeton Review’s 2025 Best Business Schools rankings.

The university's MBA program in finance earned a No. 3 ranking, climbing up two spots from its 2024 ranking. Finance MBA programs at the University of Virginia's Darden Graduate School of Business and New York University's Leonard N. Stern School of Business were the only ones to outrank Rice, claiming No. 2 and No. 1 spots, respectively.

Rice's online MBA program was ranked No. 5, compared to No. 4 last year. Indiana University's Bloomington Kelley School of Business' online program claimed the top spot.

“These rankings reflect the commitment of our faculty and staff, the drive and talent of our students and the strong support of our alumni and partners,” Peter Rodriguez, dean of Rice Business, said in a news release. “They are exceptional honors but also reminders — not just of our top-tier programs and world-class faculty and students but of our broader impact on the future of business education.”

Rice also ranked at No. 6 for “greatest resources for minority students."

The Princeton Review’s 2025 business school rankings are based on data from surveys of administrators at 244 business schools as well as surveys of 22,800 students enrolled in the schools’ MBA programs during the previous three academic years.

"The schools that made our lists for 2025 share four characteristics that inform our criteria for designating them as 'best': excellent academics, robust experiential learning components, outstanding career services, and positive feedback about them from enrolled students we surveyed," Rob Franek, The Princeton Review's editor-in-chief, said in a press release. "No b-school is best overall or best for all students, but to all students considering earning an MBA, we highly recommend these b-schools and salute them for their impressive programs."

Rice's finance program has ranked in the top 10 for eight consecutive years, and its online MBA has ranked in the top five for four years.

Rice and the University of Houston also claimed top marks on the Princeton Review's entrepreneurship rankings. Rice ranks as No. 1 on the Top 50 Entrepreneurship: Grad list, and the University of Houston ranked No. 1 on Top 50 Entrepreneurship: Ugrad. Read more here.

Houston named ‘star’ metro for artificial intelligence in new report

eyes on AI

A new report declares Houston one of the country’s 28 “star” hubs for artificial intelligence.

The Houston metro area appears at No. 16 in the Brookings Institution’s ranking of metros that are AI “stars.” The metro areas earned star status based on data from three AI buckets: talent, innovation and adoption. Only two places, the San Francisco Bay Area and Silicon Valley, made Brookings’ “superstar” list.

According to Brookings, the Houston area had 11,369 job postings in 2024 that sought candidates with AI skills, 210 AI startups (based on Crunchbase data from 2014 to 2024), and 113 venture capital deals for AI startups (based on PitchBook data from 2023 to 2024).

A number of developments are boosting Houston’s AI profile, such as:

Brookings also named Texas’s three other major metros as AI stars:

  • No. 11 Austin
  • No. 13 Dallas-Fort Worth
  • No. 40 San Antonio

Brookings said star metros like Houston “are bridging the gap” between the two superstar regions and the rest of the country. In 2025, the 28 star metros made up 46 percent of the country’s metro-area employment but 54 percent of AI job postings. Across the 28 metros, the number of AI job postings soared 139 percent between 2018 and 2025, according to Brookings.

Around the country, dozens of metros fell into three other categories on Brookings’ AI list: “emerging centers” (14 metros), “focused movers” (29 metros) and “nascent adopters” (79 metros).