Houston voices

Houston research: Why you need a data management plan

Every situation is unique and deserves a one-of-the-kind data management plan, not a one-size-fits-all solution. Graphic byMiguel Tovar/University of Houston

Why do you need a data management plan? It mitigates error, increases research integrity and allows your research to be replicated – despite the “replication crisis” that the research enterprise has been wrestling with for some time.

Error

There are many horror stories of researchers losing their data. You can just plain lose your laptop or an external hard drive. Sometimes they are confiscated if you are traveling to another country — and you may not get them back. Some errors are more nuanced. For instance, a COVID-19 repository of contact-traced individuals was missing 16,000 results because Excel can’t exceed 1 million lines per spreadsheet.

Do you think a hard drive is the best repository? Keep in mind that 20 percent of hard drives fail within the first four years. Some researchers merely email their data back and forth and feel like it is “secure” in their inbox.

The human and machine error margins are wide. Continually backing up your results, while good practice, can’t ensure that you won’t lose invaluable research material.

Repositories

According to Reid Boehm, Ph.D., Research Data Management Librarian at the University of Houston Libraries, your best bet is to utilize research data repositories. “The systems and the administrators are focused on file integrity and preservation actions to mitigate loss and they often employ specific metadata fields and documentation with the content,” Boehm says of the repositories. “They usually provide a digital object identifier or other unique ID for a persistent record and access point to these data. It’s just so much less time and worry.”

Integrity

Losing data or being hacked can challenge data integrity. Data breaches do not only compromise research integrity, they can also be extremely expensive! According to Security Intelligence, the global average cost of a data breach in a 2019 study was $3.92 million. That is a 1.5 percent increase from the previous year’s study.

Sample size — how large or small a study was — is another example of how data integrity can affect a study. Retraction Watch removes approximately 1,500 articles annually from prestigious journals for “sloppy science.” One of the main reasons the papers end up being retracted is that the sample size was too small to be a representative group.

Replication

Another metric for measuring data integrity is whether or not the experiment can be replicated. The ability to recreate an experiment is paramount to the scientific enterprise. In a Nature article entitled, 1,500 scientists lift the lid on reproducibility, “73 percent said that they think that at least half of the papers can be trusted, with physicists and chemists generally showing the most confidence.”

However, according to Kelsey Piper at Vox, “an attempt to replicate studies from top journals Nature and Science found that 13 of the 21 results looked at could be reproduced.”

That's so meta

The archivist Jason Scott said, “Metadata is a love note to the future.” Learning how to keep data about data is a critical part of reproducing an experiment.

“While this will be always be determined by a combination of project specifics and disciplinary considerations, descriptive metadata should include as much information about the process as possible,” said Boehm. Details of workflows, any standard operating procedures and parameters of measurement, clear definitions of variables, code and software specifications and versions, and many other signifiers ensure the data will be of use to colleagues in the future.

In other words, making data accessible, useable and reproducible is of the utmost importance. You make reproducing experiments that much easier if you are doing a good job of capturing metadata in a consistent way.

The Big Idea

A data management plan includes storage, curation, archiving and dissemination of research data. Your university’s digital librarian is an invaluable resource. They can answer other tricky questions as well: such as, who does data belong to? And, when a post-doctoral student in your lab leaves the institution, can s/he take their data with them? Every situation is unique and deserves a one-of-the-kind data management plan, not a one-size-fits-all solution.

------

This article originally appeared on the University of Houston's The Big Idea. Sarah Hill, the author of this piece, is the communications manager for the UH Division of Research.

Trending News

Building Houston

 
 

Three of Houston's mayoral candidates shared the stage at Tech Rodeo to talk about how they would lead the city toward greater success within the innovation space. Photo by Natalie Harms/InnovationMap

It's an election year in Houston, and one of the big topics on the minds of the candidates is how to continue the momentum of Houston's developing innovation ecosystem.

Houston Exponential put three of the declared candidates on the stage yesterday to ask them about their vision for Houston on the final day of Houston Tech Rodeo 2023. HX CEO Natara Branch moderated the discussion with Chris Hollins, Lee Kaplan, and Amanda K. Edwards. Each candidate addressed issues from diversity and equity, the energy transition, and more.

Missed the conversations? Here are a few overheard moments and highlights of the panel.

“It’s integral to our vision for the future of Houston that this is a place where small businesses, entrepreneurs, and creatives can thrive. We want to grow this economy to be one of the strongest economies in the United States — and we know that startups and small businesses are the powerhouse for that.”

— says Chris Hollins, who explains that he's a small business owner himself and also served as interim Harris County Clerk from June 2020 to November 2020, overseeing the 2020 United States presidential election in Harris County.

“Houston has an energy-centric community, and a lot of people who have money have gotten too comfortable investing in just oil and gas. … I understand how hard it is to run a business, and I understand (it) from representing entrepreneurs and investors.”

— says Lee Kaplan, a founding partner at law firm Smyser Kaplan & Veselka LLP.

“One of the things that’s important in a leader is making sure that they understand your issues, but most importantly that they can execute. That has been something that has been chief in concert in the way that I have served in public service, but of course the way that I’ve been a part of the startup economy.“

— says Amanda K. Edwards, who contributed to the establishment of the city’s tech and innovation task force as an at-large Houston City Council member. The task force resulted in the creation of HX Venture Fund and the Innovation District, she explains.

“When we think about cities that have done this really well — Silicon Valley, The Bay Area, Boston, Austin — what’s key in many of those cities is institutions around education. … We have to lean into Rice University and the University of Houston — making these centers for talent, excellence, and innovation so that we’re developing the thinkers, the engineers, the creators of the future, and then we’re giving your businesses a crop of new hires.”

— Hollins says responding to a question about Houston's challenges.

“The thing that I think is the most important for the city is to be rigorous with what we do. We’re not going to get around the fact that it’s hot and we have mosquitos. But we can sell the fact that we have a city that’s improving.”

— Kaplan says on Houston's progress.

“I don’t want to compete or lose to any city in America. When I think about Houston, I’m bullish. I know that we are the place that is home to innovation, and it’s about time that people know us as that."

— Edwards says, referencing how Houston is known nationally for its problems — she gives the example of Hurricane Harvey. “We have major challenges in our city, but we can innovate using our innovation economy to provide answers and solutions to them.”

“Energy has to be a part of our story. We are where we are today because we’re the energy capital of the world. And we know that the energy transition is happening, and if we don’t lean into that, our region stands to lose hundreds of thousands of jobs.”

— Hollins says on the types of emerging tech in Houston.

“You often hear it said that Houston is the most diverse city in the nation, but I pose this challenge: What good is it to be the most diverse if we’re not solving the challenges that diverse communities face? And that includes equity in tech. We have all of the raw ingredients here in the Houston community to make Houston the home of where tech and innovation is diverse and equitable.”

— Edwards says on Houston's diversity and the challenges the city faces.

Trending News