Let's talk about dark data — what it means and how to navigate it. Graphic by Miguel Tovar/University of Houston

Is it necessary to share ALL your data? Is transparency a good thing or does it make researchers “vulnerable,” as author Nathan Schneider suggests in the Chronicle of Higher Education article, “Why Researchers Shouldn’t Share All Their Data.”

Dark Data Defined

Dark data is defined as the universe of information an organization collects, processes and stores – oftentimes for compliance reasons. Dark data never makes it to the official publication part of the project. According to the Gartner Glossary, “storing and securing data typically incurs more expense (and sometimes greater risk) than value.”

This topic is reminiscent of the file drawer effect, a phenomenon which reflects the influence of the results of a study on whether or not the study is published. Negative results can be just as important as hypotheses that are proven.

Publication bias and the need to only publish positive research that supports the PI’s hypothesis, it can be argued, is not good science. According to an article in the Indian Journal of Anaesthesia, authors Priscilla Joys Nagarajan, et al., wrote: “It is speculated that every significant result in the published world has 19 non-significant counterparts in file drawers.” That’s one definition of dark data.

Total Transparency

But what to do with all your excess information that did not make it to publication, most likely because of various constraints? Should everything, meaning every little tidbit, be readily available to the research community?

Schneider doesn’t think it should be. In his article, he writes that he hides some findings in a paper notebook or behind a password, and he keeps interviews and transcripts offline altogether to protect his sources.

Open-source

Open-source software communities tend to regard total transparency as inherently good. What are the advantages of total transparency? You may make connections between projects that you wouldn’t have otherwise. You can easily reproduce a peer’s experiment. You can even become more meticulous in your note-taking and experimental methods since you know it’s not private information. Similarly, journalists will recognize this thought pattern as the recent, popular call to engage in “open journalism.” Essentially, an author’s entire writing and editing process can be recorded, step by step.

TMI

This trend has led researchers to open-source programs like Jupyter and GitHub. Open-source programs detail every change that occurs along a project’s timeline. Is unorganized, excessive amounts of unpublishable data really what transparency means? Or does it confuse those looking for meaningful research that is meticulously curated?

The Big Idea

And what about the “vulnerability” claim? Sharing every edit and every new direction taken opens a scientist up to scoffers and harassment, even. Dark data in industry even involves publishing salaries, which can feel unfair to underrepresented, marginalized populations.

In Model View Culture, Ellen Marie Dash wrote: “Let’s give safety and consent the absolute highest priority, with openness and transparency prioritized explicitly below those. This means digging deep, properly articulating in detail what problems you are trying to solve with openness and transparency, and handling them individually or in smaller groups.”

------

This article originally appeared on the University of Houston's The Big Idea. Sarah Hill, the author of this piece, is the communications manager for the UH Division of Research.

Ad Placement 300x100
Ad Placement 300x600

CultureMap Emails are Awesome

Microsoft partners with Rice University's OpenStax on AI teaching tool

group project

Rice University’s OpenStax and Microsoft are partnering to integrate the nonprofit’s content with the tech giant’s AI innovation, known as Learning Zone.

“At OpenStax, our mission is to make an amazing education accessible to all,” Richard G. Baraniuk, founder and director of OpenStax, said in a news release. “That’s why we’re excited to integrate our trustworthy, peer-reviewed content with Microsoft’s AI technology through the Microsoft Learning Zone. Together, we aim to help more instructors and their students access engaging, effective learning experiences in new and dynamic ways. We also share a strong commitment to the thoughtful and responsible application of AI to better ensure all learners can succeed.”

OpenStax is a provider of affordable instructional technologies and is also one of the world’s largest publishers of open educational resources (OER).

Microsoft Learning Zone promises to provide educators and students with “responsible AI technology and peer-reviewed educational content to support learning” on Microsoft Copilot+ PCs. Microsoft Learning Zone works by utilizing on-device AI to generate interactive lessons for students, and its integration with OpenStax content means educators can rely on OpenStax’s digital library of 80 openly licensed titles.

The goal is for educators to create effective and engaging learning experiences safely, thereby bypassing the need to source and vet content independently. Included is a library of ready-to-use lessons, opportunity for immediate feedback and differentiated learning. Educators will maintain control of instructional content and pedagogical strategies and will be able to update or edit lessons or activities prior to sharing them with students.

Other tools included in the Microsoft Learning Zone are additional languages, reading coaching, public speaking help, math and reading progress, and a partnership with the online quiz platform Kahoot!

OpenStax resources have been reported as used across 153 countries, and this current collaboration combines the power and potential of responsible AI usage in education with content that has been utilized by 13,569 K-12 schools and 71 percent of U.S. colleges and universities, according to Rice.

“Through our partnership with OpenStax, we’re combining the power of on-device AI in Copilot+ PCs with OpenStax’s trusted and diverse peer-reviewed content to help educators quickly create high-quality, personalized, engaging lessons,” Deirdre Quarnstrom, vice president of Microsoft Education, added in the news release. “We’re excited about how this collaboration will empower classrooms globally.”

Houston ranks among the 15 best cities in America for 2025

Ranking It

A new national report ranking the best cities in America has declared Houston is the 13th best U.S. city in 2025.

The annual "America's Best Cities" report from Resonance Consultancy, a Canada-based real estate and tourism marketing firm, ranks the relative qualities of livability, cultural "lovability," and economic prosperity in America's principal cities with metropolitan populations of 500,000 or more.

These top 100 cities "brilliantly weave local culture with economic prospects, aligning with global and domestic travel trends to create premier experiences for both residents and travelers."

Houston's new placement is three spots lower than its triumphant No. 10 rank in 2024. Though it no longer holds a prized top-10 spot, the report hails Houston as a "coveted hometown for the best and brightest on earth."

Among Houston's other shining attributes are its booming population growth – which has now grown to a staggering 7.51 million people – plus its competitive real estate market and a vibrant hospitality and culinary scene. Additionally, Houston earned a well-deserved No. 4 rank in the report's "restaurants" subcategory.

"A 221-room Home2 Suites/Tru by Hilton debuted near Toyota Center in March, while Marriott’s newly renovated Residence Inn NASA/Clear Lake keeps surging visitors happy," the report said. "West Houston’s Greenside will convert 35,000 square feet of warehouses into a park-laced retail hub by 2026, while America’s inaugural Ismaili Center, rising along Allen Parkway, remains on schedule for later this year, adding yet another cultural jewel to H-Town’s festival-fueled mosaic."

The report additionally puts a large focus on Houston's innovative energy sector and its numerous large companies that boost the local economy and create thousands of jobs. Several of these Houston-based companies are some of the best places to work, according to U.S. News and World Report.

One such company that was specifically praised in the report was the HyVelocity Hub, which aims to "ensure economic resilience and long-lasting, domestic energy production" along the Gulf Coast.

"The HyVelocity Hydrogen Hub just locked in up to $1.2 billion from the U.S. DOE, targeting 45,000 jobs and slicing 7.7 million tons of CO₂ a year," the report said.

Elsewhere in Texas

Dallas (No. 14) and Austin (No. 15) ranked right behind Houston in the top 20, while San Antonio sank into the No. 41 spot after previously ranking 32nd last year. El Paso also fell from No. 83 in 2024 to No. 100 this year.

Jason McGrath, the executive vice president and head of U.S. Corporate Reputation at Ipsos, said in the report that American cities are proving their resiliency as they are faced with "climate issues, increasing infrastructural demands, and shifting geopolitical landscapes affecting international relations and travel."

"Despite these hurdles, cities are creatively reinventing themselves to shine as examples of cultural vibrancy and innovation, keeping their competitive edge both locally and globally," McGrath said. "Our report shows that the allure of American cities remains strong, thanks to their rich diversity and vast attractions — from natural wonders to buzzing cultural hubs."

The top 10 best cities in America in 2025 are:

  • No. 1 – New York City, New York
  • No. 2 – Los Angeles, California
  • No. 3 – Chicago, Illinois
  • No. 4 – San Francisco, California
  • No. 5 – Seattle, Washington
  • No. 6 – Miami, Florida
  • No. 7 – Boston, Massachusetts
  • No. 8 – Washington, D.C.
  • No. 9 – Las Vegas, Nevada
  • No. 10 – San Diego, California
---

This article originally appeared on CultureMap.com.

Houston biosciences company opens new sequencing center for drug development

bioscience breakthroughs

Houston-based Avance Biosciences has launched the Next-Generation Sequencing Center of Excellence, designed to enhance the company’s sequencing capabilities for drug development. Specifically, the facility at the company’s main campus in Northwest Houston will pursue breakthroughs in biologics, cell therapy and gene therapy.

In the drug industry, sequencing refers to studying nucleotides in DNA and RNA molecules. Nucleotides are the building blocks of DNA and RNA.

“This is a major milestone for Avance Biosciences as we continue to support the evolving needs of biologics and cell and gene therapy developers,” Xuening “James” Huang, co-founder, CEO and chief technology officer of Avance, said in a news release. “By consolidating state-of-the-art sequencing platforms and scientific talent, we’ve created a highly capable organization ready to solve complex genomic challenges with precision and compliance.”

In 2013, Avance rolled out next-generation sequencing (NGS) that complies with federal guidelines. Since then, Avance “has remained at the forefront of regulated sequencing services,” the company said. “The launch of the (new center) strengthens the company’s ability to deliver accurate, reproducible, and regulatory-aligned sequencing data across a wide array of therapeutic modalities.”

Cal Froberg, senior vice president of sales and marketing at Avance, said pharmaceutical and biotech clients trust the company’s technical capabilities and regulatory compliance.

“With the ever-changing global landscape and increasing scrutiny around international sample shipments, conducting advanced, cost-effective NGS testing domestically is now more feasible than ever,” Froberg said. “Our clients have confidence that their samples will remain in the U.S.”

Avance, founded in 2010, plans to hold an open house at the new facility in September to showcase its capabilities, technology, talent, and services. The company’s services include sequencing, molecular biology, cell-based testing, and bioanalytical testing.