Skip to main content

Data Management, Analysis & Visualization Research Area

College of Information Science faculty are leaders in the data management, analysis and visualization research area, including 3D data and virtual reality, bibliometrics, cybersecurity, data curation, data curation and preservation, data engineering, data visualization, high-performance computing applications, information organization, metadata, and parallel and distributed systems.


Faculty

Image
Cristian Román-Palacios

Cristian Román

Assistant Professor
Coordinator and Advisor for MSDS and MSIS
  • Phylogenetics and biodiversity drivers
  • Paleoclimatic reconstructions and climate change
  • Applied data mining and machine learning

Select Current & Recent Research

Current and recent funded faculty research in this area includes but is not limited to the following projects:

Open and FAIR Samples: Maturing the Sample Data Ecosystem
PI: Natalie Raia (The University of Arizona)
Co-PI: Andrea Thomer (The University of Arizona)
Funding: National Science Foundation, $599,462
Project Dates: January 1, 2026 – December 31, 2028 (estimated)
Summary:
This project advances FAIR principles in the Earth and space science sample data ecosystem by promoting widespread adoption of persistent identifier (PID) infrastructure, ensuring that samples are uniquely identified, persistently linked to research outputs and properly cited in the scholarly record. Through analysis, coordination, education and community engagement across researchers, sample repositories, data repositories and publishers, the project aims to build a more integrated and sustainable open science infrastructure for sample-based research.


FAIROS RCN: Ethical Open Science for Past Global Change Data
PI: Andrea Thomer (The University of Arizona)
Funding: National Science Foundation, $246,097
Project Dates: January 1, 2023 – December 31, 2027 (estimated)
Website: eos-rcn.github.io/web/home
Summary:
In this Research Coordination Network (RCN), researchers aim to build technical and social capacity among community-curated data repositories in the quaternary sciences (e.g., paleoecology, archaeology, paleobiology). They are doing this by supporting technical implementation of ethical open science principles and developing communities of practice focused on CARE and FAIR principles.


Internet of Samples (iSamples): Toward an Interdisciplinary Cyberinfrastructure for Material Samples
PI: Andrea Thomer (The University of Arizona)
Funding: National Science Foundation, $1,072,351.00
Project Dates: August 1, 2020 – July 31, 2026
Website: isamples.org
Summary:
The Internet of Samples (iSamples) is a multi-disciplinary and multi-institutional project to design, develop and promote service infrastructure to uniquely, consistently and conveniently identify material samples, record metadata about them and persistently link them to other samples and derived digital content, including images, data and publications. 


Next Generation Interoperable Data Infrastructure for Geoscience Sample Data (EarthChem, LEPR/traceDs, SESAR): IEDA Re-invented
PI: Kerstin Lehnert (Columbia University) 
Co-PIs: Andrea Thomer (The University of Arizona), Peng Ji (Columbia University)
Funding: Columbia University, National Science Foundation, $257,591
Project Dates: August 15, 2022 – July 31, 2027 (estimated)
Website: www.geosamples.org
Summary:
The Interdisciplinary Earth Data Alliance is a collaborative data infrastructure of three complementary data systems—EarthChem, LEPR/traceDs and SESAR—that jointly support researchers in the geosciences to share and access sample data following the FAIR data principles. In this grant, Andrea Thomer is funded as the product manager of SESAR, and is guiding the transition to a new, independent, multi-disciplinary infrastructure for sample registration.


Preserving Research Data from Forest Service Experimental Areas in Arizona, Phase II
PI: Bryan Heidorn (The University of Arizona)
Funding: U.S. Forest Service, $140,000
Project Dates: June 12, 2023 – May 12, 2028
Summary:
The University of Arizona is collaborating with the National Forest System to preserve and make accessible decades of historical research data from four Forest Service Experimental Areas in Flagstaff, Arizona. These areas, which include Fort Valley, Long Valley, Sierra Ancha and Beaver Creek, house valuable data in both paper and digital formats. The project aims to digitize and publish this data through the FS Research Data Archive, ensuring it remains accessible and usable for future scientific, educational and land management purposes. Additionally, the project will create digital versions of paper records for ongoing use while preserving the originals for long-term stewardship by the National Archives and Records Administration (NARA). The collaboration will involve a data librarian working closely with the Forest Service’s Research Data Services and chief data officer to guide the project, which has the potential to impact data management across the entire National Forest System and beyond. InfoSci graduate and undergraduate students have worked on this project. Some of the students have traveled to the research areas to collect documents for reorganization, preservation and digitization.


Proof of Concept for a Forest Service-wide Data Catalog
PI: Bryan Heidorn (The University of Arizona)
Staff Research Lead: JosephAveril Cate, Jr. (The University of Arizona)
Funding: U.S. Forest Service, $400,000
Project Dates: September 4, 2023 – November 3, 2026
Summary:
The project is national in scope, covering data from districts, forests, regions and the entire National Forest System, particularly wildfire data. The project involves a data librarian working with USFS Research Data Services and the chief data officer to integrate data management across multiple programs. It aims to establish a proof of concept that impacts a wide range of Forest Service data creators and users, with guidance from the USDA Axon Data Governance platform to ensure broad and effective data use. The project aims to create a comprehensive data inventory for the Forest Service to meet Evidence Act requirements, including developing a searchable data catalog with metadata. It targets datasets related to the Wildfire Crisis Strategy and seeks to integrate the selected data catalog platform with the USDA's enterprise data catalog and governance platform, while also marketing published datasets through federal platforms like data.gov and GeoPlatform.