
Publication & Scholarly activity | Source Code


Greenberg, J., Boveda-Aguirre, P., …, McClellan, S., Tadmor, E. (2025). Towards MatCore: A Unified Metadata Standard for Materials Science. In: Sfakakis, M., Garoufallou, E., Damigos, M., Salaba, A., Papatheodorou, C. (eds) Metadata and Semantic Research. MTSR 2024. Communications in Computer and Information Science, vol 2331. Springer, Cham. and [PRE-PRINT]

Polson, K., Potapova, M., Meena, U., Peiper, C., Brown, J., Agar, J., & Greenberg, J. (2025). Making Sense of Metadata Mess: Alignment and Risk Assessment for Diatom Data Use Case. In: Sfakakis, M., Garoufallou, E., Damigos, M., Salaba, A., Papatheodorou, C. (eds) Metadata and Semantic Research. MTSR 2024. Communications in Computer and Information Science, vol 2331. Springer, Cham. Links:[Official Journal][arXiv Pre-Print][Slides]


An, Y., Kolanupaka, S., An, J., Ma, M., Chhatwal, U., Kalinowski, A., Rogers, M., & Smith, B. (2024). Is the Lecture Engaging? Lecture Sentiment Analysis for Knowledge Graph-Supported Intelligent Lecturing Assistant (ILA) System. IEEE International Conference on Big Data (Print), 3358–3366.

Greenberg, J., Polson, K., McClellan, S., Zhao, X., Kalinowski, A., & An, Y. (2024, September 17). Enhancing semantic interoperability across materials science with HIVE4MAT. The 1st International Workshop on Semantic Materials Science (SeMatS), Amsterdam, The Netherlands. Links:[Official Journal][arXiv Pre-Print][Slides]

Pepper, J., Jones, E., Zhao, X., Furst, J., Langlois, K., Uribe-Romo, F., Breen, D., Greenberg, J. (2024). AI-Ready Data: Knowledge Extraction from Archival Lab Notebooks. IEEE International Conference on Big Data, pages 2489–2495. [Slides][Presentation]

Polson, K., Greenberg, J., and McClellan, S. (2024). Aligning Keywords from Long Form Prose to Controlled Vocabulary. Presentation, Code4Lib, Ann Arbor Michigan, May 14-15, 2024. [Slides][Presentation]

Balk, M., Bradley, J., Maruf, M., Altintaş, B., Bakis, Y., Bart Jr, H., Breen, D., Florian, C., Greenberg, J., Karpatne, A., Karnani, K., Mabee, P., Pepper, J., Jebbia, D., Tabarin, T., Wang, X., Lapp, H. (2024). A FAIR and modular image-based workflow for knowledge discovery in the emerging field of imageomics. Methods in Ecology and Evolution.

McClellan, S., de Oliveira, I. M., Rauch, C., Adriaenssens, S., & Greenberg, J. (2024). Exploratory analysis of a crowdsourcing metadata tool for building terminological consensus in civil engineering. Automation in Construction, 166, 105627-.

Zhao, X., Langlois, K., Furst, J., An, Y., Hu, X., Gualdron, D. G., Uribe-Romo, F., & Greenberg, J. (2024). Research evolution of metal organic frameworks: A scientometric approach with human-in-the-loop. Journal of Data and Information Science (Warsaw, Poland), 9(3), 44–64.

Breen, D. (2024). Image Informatics for Metadata Extraction and Verification of Museum Specimen Images. Advances in Digital Media Workshop Series. June 2024. [Slides]

McClellan, S., An, Y., Zhao, X., Lin, X., & Greenberg, J. (2024). Characterizing Semantic Ambiguity of the Materials Science Ontologies. In Knowledge Organization for Resilience in Times of Crisis: Challenges and Opportunities (1st ed., pp. 129–144). Ergon – ein Verlag in der Nomos Verlagsgesellschaft.


Breen, D., Senin, A., Levere, A., Pepper, J., Greenberg, J. (2023). Specimen Outlining: A Computational Archival Science Approach. IEEE International Conference on Big Data, pages 2004-2009.

Polson, K., McClellan, S., and Greenberg ,J. (2023). Advancing HIVE-4-MAT’s Capacity to Leverage Multiple Vocabularies. 2023 Vocabulary Symposium , Canberra, ANU, November 14-15, 2023. [Slides][Presentation]

Zhao, X., Langlois, K., Furst, J., McClellan, S., Fleur, R., An, Y., … & Greenberg, J. (2023, December). When LLM Meets Material Science: An Investigation on MOF Synthesis Labeling. In 2023 IEEE International Conference on Big Data (BigData) (pp. 6320-6321). IEEE Computer Society.: 10.1109/BigData59044.2023.10386438 [Short paper/poster presentation]

Fleur, R., Addy Ireland, A., Zhao, X., McClellan, S., Paltoo, E., Su, T., Lee, C., An, Y., Hu, X., Ertekin, E., Greenberg, J. (2023, December). “Investigating Data Reusability in Density Functional Theory Studies,” 2023 IEEE International Conference on Big Data (BigData), Sorrento, Italy, 2023, pp. 6143-6144, doi: 10.1109/BigData59044.2023.10386967. [Short paper/poster presentation]

Zhao, X., Langlois, K., Furst, J., McClellan, S., Hu, X., An, Y., … & Greenberg, J. (2023). Metadata for Scientific Experiment Reporting: A Case Study in Metal-Organic Frameworks. arXiv preprint arXiv:2310.12417.

An, Y., Greenberg, J., Kalinowski, A., Zhao, X., Hu, X., Uribe-Romo, F. J., & Gómez-Gualdrón, D. A. (2023). Knowledge Graph Question Answering for Materials Science (KGQA4MAT): Developing Natural Language Interface for Metal-Organic Frameworks Knowledge Graph (MOF-KG). arXiv preprint arXiv:2309.11361.

Jebbia, D., Wang, X., Bakis, Y., Bart Jr., H.L., Greenberg, J. (2023). Toward a Flexible Metadata Pipeline for Fish Specimen Images. In: Garoufallou, E., Vlachidis, A. (eds) Metadata and Semantic Research, pp 175-190. MTSR 2022. Communications in Computer and Information Science, vol 1789. Springer, Cham.

Greenberg, J., McClellan, S., Zhao, X., Kellner, E., Venator, D., Zhao, H., Shen, J., Hu, X., & An, Y. (2023). Materials Science Ontology Design with an Analytical-Synthetico Facet Analysis Framework. In: Garoufallou, E., Vlachidis, A. (eds) Metadata and Semantic Research, pp 211–221. MTSR 2022. Communications in Computer and Information Science, vol 1789. Springer, Cham.

McClellan, S., An, Y., Greenberg, J., Zhao, X. (2023). Along the Border: Term Overlap Among 5 Matportal Ontologies. 2023 Research Data Alliance Plenary Meeting. [Slides_RDA]

Zhao, X., McClellan, S., An, Y., Greenberg, J. (2023).Extracting Metal-Organic Framework Knowledge from Scholarly Big Data. 2023 Research Data Alliance Plenary Meeting. [Poster]

Greenberg, J., McClellan, S., Rauch, C., Zhao, X., Kelly, M., An, Y., Kunze, J., Orenstein, R., Porter, C., Meschke, V., & Toberer, E. (2023). Building Community Consensus for Scientific Metadata with YAMZ. Data Intelligence, 5(1), 242–260.


Pepper, J., Greenberg, J., Bakiş, Y., Wang, X., Bart, H., & Breen, D. (2021, September). Automatic metadata generation for fish specimen image collections. In 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (pp. 31-40). IEEE.

Elhamod, M., Diamond, K. M., Maga, A. M., Bakis, Y., Bart Jr, H. L., Mabee, P., … & Karpatne, A. (2022). Hierarchy‐guided neural network for species classification. Methods in Ecology and Evolution13(3), 642-652.

An, Y., Greenberg, J., Hu, X., Kalinowski, A., Fang, X., Zhao, X., … & Daniel, R. (2022, December). Exploring Pre-Trained Language Models to Build Knowledge Graph for Metal-Organic Frameworks (MOFs). In 2022 IEEE International Conference on Big Data (Big Data) (pp. 3651-3658). IEEE.

Razzaghi, H., Kahn, M., Lehmann, M., Sciolla, J., Marchesani, , Lyman, K. Greenberg, J. and Bailey, J. (Under review, 2022). Semantic Data Quality Assessment: An Investigation of Fitness for Use in Large Clinical Datasets. American Medical Informatics Association (AMIA), 2023.

Karnani, K., Pepper, J., Bakiş, Y., Wang, X., Bart Jr, H., Breen, D. E., & Greenberg, J. (2022). Computational metadata generation methods for biological specimen image collections. International Journal on Digital Libraries, 1-18.

Greenberg, J., & Marciano, R. (2022). Innovating Data Science Education and Computational Thinking: Connecting iSchools and LAMs. 2022 Digital Library Federation Forum. [Slides]

Venator, D., Kellner, E., Zhao, X., Greenberg, J. (2022). Automated identification of metal-organic framework synthesis information. Smart Manufacturing REU (SMREU)/Institute for Data Driven Dynamical Design (ID4) Poster Session [Poster]

Kellner, E., Venator, D., Shen, C., Zhao, H., Zhao, X., McClellan, S., Greenberg, J. (2022). Exploring faceted ontologies for the indexing of materials science literature. Smart Manufacturing REU (SMREU)/Institute for Data Driven Dynamical Design (ID4) Poster Session [Poster]

An, Y., Greenberg, J., Zhao, X., Hu, X., McCLellan, S., Kalinowski, A., … & Ardila, K. (2022). Building open knowledge graph for metal-organic frameworks (MOF-KG): Challenges and Case Studies. arXiv preprint arXiv:2207.04502.

Grabus, S., Logan, P. M., & Greenberg, J. (2022). Temporal Concept Drift and Alignment: An Empirical Approach to Comparing Knowledge Organization Systems Over Time. Knowledge Organization, 49(2), 69-78.


Grabus, S., & Greenberg, J. (2021, December). Computational curation and the application of large-scale vocabularies. In 2021 IEEE International Conference on Big Data (Big Data) (pp. 2220-2223). IEEE. 10.1109/BigData52589.2021.9671611. [Slides] [Recording]

Greenberg, J., Zhao, X., Monselise, M., Grabus, S., & Boone, J. (2021). Knowledge organization systems: A network for AI with helping interdisciplinary vocabulary engineering. Cataloging & Classification Quarterly, 1-20.

Zhao, X., Greenberg, J., McClellan, S., Hu, Y. J., Lopez, S., Saikin, S. K., … & An, Y. (2021, December). knowledge graph-empowered materials discovery. In 2021 IEEE International Conference on Big Data (Big Data) (pp. 4628-4632). IEEE. 10.1109/BigData52589.2021.9671503 [Slides]

Zhao, X., Greenberg, J., An, Y., & Hu, X. T. (2021, December). Fine-tuning BERT model for materials named entity recognition. In 2021 IEEE International Conference on Big Data (Big Data) (pp. 3717-3720). IEEE. 10.1109/BigData52589.2021.9671697 [Slides]

Leipzig, J., Nüst, D., Hoyt, C. T., Ram, K., & Greenberg, J. (2021). The role of metadata in reproducible computational research. Patterns, 2(9).  [Paper] (SLIDES reporting on the paper to IEEE P2957 Big Data Governance and Metadata Management Working Group (BDGMM-WG))

Rauch, C. B., Kelly, M., Kunze, J. A., & Greenberg, J. (2022). FAIR metadata: A community-driven vocabulary application. In E. Garoufallou, M.-A. Ovalle-Perandones, & A. Vlachidis (Eds.), Metadata and Semantic Research (pp. 187–198). Springer International Publishing.

Shpilker, P., Freeman, J., McKelvie, H., Ashey, J., Fonticella, J. M., Putnam, H., Greenberg, J., Cowen L. J., Couch, A., Daniels, N.M. (In Press, 2021). MEtaData format for open reef data (MEDFORD), In MTSR 2021: Research Conference on Metadata and Semantics Research. Springer, Cham.

An, Y., Kalinowski, A., & Greenberg, J. (2021, November). Clustering and network analysis for the embedding spaces of sentences and sub-sentences. In 2021 Second International Conference on Intelligent Data Science Technologies and Applications (IDSTA) (pp. 138-145). IEEE.

Greenberg, J. (2021). Metadata, ontologies, and the interoperability continuum. Research Data Alliance 18th Plenary Meeting. [Slides]

Pepper, J., Greenberg, J., Bakiş, Y., Wang, X., Bart, H., & Breen, D. (2021, September). Automatic metadata generation for fish specimen image collections. In 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL) (pp. 31-40). IEEE. [Best Student-led Research Paper Award] 10.1109/JCDL52503.2021.00015

Zhao, X., Lopez, S., Saikin, S., Hu, X., & Greenberg, J. (2021). Text to insight: Accelerating organic materials knowledge extraction via deep learning. Proceedings of the Association for Information Science and Technology, 58(1), 558-562.

Huvila, I., Greenberg, J., Sköld, O., Thomer, A., Trace, C., & Zhao, X. (2021). Documenting information processes and practices: Paradata, provenance metadata, life‐cycles and pipelines. Proceedings of the Association for Information Science and Technology, 58(1), 604-609.

Greenberg, J., Rauch, C. B., & Kelly, M. (2021). Project pipeline: Preservation, persistence, and performance. 17th International Conference on Digital Preservation (iPRES), Beijing, China. [Preprint] [Video Recording] [Slides]

Zhao, X., Greenberg, J., Meschke, V., Toberer, E., & Hu, X. (2021). An exploratory analysis: Extracting  materials science knowledge from unstructured scholarly data. The Electronic Library. [doi:]

Elhamod, M., Diamond, K. M., Maga, A. M., Bakis, Y., Bart, H. L., Mabee, P., Dahdu, W., Leipzig, J., Greenberg, J., Avants, B., & Karpatne, A. (2021). Hierarchy-guided neural networks for species classification. bioRxiv: [In press: Methods in Ecology and Evolution:]

Razzaghi, H., Greenberg, J., & Bailey, L. C. (2021). Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use (p. e10264):

Greene, M., Grabus, S., Greenberg, J. (2021). DARSI: An ontology for facilitating the development of data sharing and use agreements. In Proceedings from North American Symposium on Knowledge Organization. 8, 1-7. [Paper] [ArXiv] [Slides]

McClellan, S., Kelly, M., & Greenberg, J. (2021). Modeling Ephraim Chambers’ knowledge structure from a naïve standpoint. In Proceedings from North American Symposium on Knowledge Organization8, 1-9. [Paper] [Slides]

Kelly, M., Greenberg, J., Rauch, C. B., Grabus, S., Boone, J. P., Kunze, J., & Logan, P. (Pidapalooza 2021). Of arks and ontologies. [Presentation]

Greenberg J., Zhao X., Adair J., Boone J., Hu X.T. (2020) HIVE-4-MAT: Advancing the ontology infrastructure for materials science. In: Garoufallou E., Ovalle-Perandones MA. (eds) Metadata and Semantic Research. MTSR 2020. Communications in Computer and Information Science, vol 1355. Springer, Cham. []

Kelly, M., Rauch, C., Greenberg, J., Grabus, S., Boone, J., Kunze, J., and Logan, P. M. (2021). Advancing ARKs in the historical ontology space. Code{4}lib Journal. Issue 50, 2021-02-10:

Breen, D., Pepper, J., &  Greenberg, J. (2021, January 21-22). Approaches for computing specimen image research data. International Conference on Statistical Tools & Techniques and Research Data Analysis (ICSTTRDA 2021), Central University of Gujarat, India. [Abstract] [Slides]

Bhatt, Jay. (January 2021). Information awareness of research data in science and engineering. Virtual International Conference on Statistical Tools and Techniques for Research Data Analysis (ICSTTRDA 2021),  the School of Library and Information Science, Central University of Gujarat, Gandhinagar, India.


Leipzig, J., Bakis, Y., Wang, X., Elhamod, M., Diamond, K., Dahdul, W., … & Greenberg, J. (2020, December). Biodiversity image quality metadata augments convolutional neural network classification of fish species. In Research Conference on Metadata and Semantics Research (pp. 3-12). Cham: Springer International Publishing.

Leipzig, J., Bakis, Y., Wang, X.,  Mohannad, E, Bart Jr., H. L., Greenberg, J, (2020): Supplemental material for: Biodiversity Image Quality Metadata Augments Convolutional Neural Network Classification of Fish Species MTSR 2020 Proceedings. Figshare:

Kelly, M., Greenberg, J., Rauch, C., Grabus, S., Boone, J., Kunze, J., & Logan, P. (2020). A Computational Approach to Historical Ontologies. 2020 IEEE International Conference on Big Data (Big Data), 1878–1883.  10.1109/BigData50022.2020.9378268

Greenberg, J., Meschke, V., Toberer, E., Cox, J., Lopez, S., Saikin, S., Chang, R., & Garnett, R. (2020, September 9-10). Assembling Ontologies for the Discovery of New Materials. NKOS Consolidated Workshop, Virtual. [Slides]

Zhao, X., Greenberg, J., Hu, X., Meschke, V., & Toberer, E. (2020, August 1-5). Scholarly Big Data: Computational Approaches to Semantic Labeling in Materials Science. ACM/IEEE Joint Conference on Digital Libraries, Wuhan, Hubei, P. R. China. [Paper] [Slides]

Leipzig, J., Nüst, D., Hoyt, C.T., Soiland-Reyes, S., Ram, K., & Greenberg, J. (2020). The Role of metadata in reproducible computational research. 1-59. doi:

Grabus, S. (2020). Evaluating the Impact of the Long-S upon 18th-Century Encyclopedia Britannica Automatic Subject Metadata Generation Results. Information Technology and Libraries, 39(3).
*Winner of 2020 LITA/Ex Libris Student Writing Award [ALA Press Release]

Greenberg, J., Grabus, S., Ke, W., Song, I., Williams, J., & Yang, E. (2020) LEADS-4-NDP Forum Report [Report].


An, Y., Chen, S., Locantore, N., Allinder, M., Mohan, D., & Bowler, R. (2019). The Utility of shapelets for analyzing physical activity of COPD patients and non-COPD controls. IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2019), Hard Rock Hotel, San Diego, CA, November, 2019. 10.1109/BIBM47256.2019.8983222

Grabus, S. & Pascua, S. (2019). Conference review: The 7th North American symposium on knowledge organization (NASKO 2019). Cataloging & Classification Quarterly, 57(6).

Logan, P. M., Greenberg, J., & Grabus, S. (2019). Knowledge Representation: Old, New, and Automated Indexing. In proceedings of Digital Humanities Conference 2019, Utrecht, The Netherlands. [Abstract]

Grabus, S. & Greenberg, J. (2019). The Landscape of Rights and Licensing Initiatives for Data Sharing. Data Science Journal, 18: 29, pp. 1–11.  

Grabus, S. & Greenberg, J. (2019). Expanded directory: Standards: Tools, and Community Initiatives. Data Science Journal, 18: 29. DOI:

Li, K., Greenberg, J., & Dunic, J. (2019). Data objects and documenting scientific processes: An analysis of data events in biodiversity data papers. Journal of the Association for Information Science and Technology.

Grabus, S., Greenberg, J., Logan, P., & Boone, J. (2019). Representing aboutness: Automatically indexing 19th-Century Encyclopedia Britannica entries. NASKO, Vol. 7. pp. 138-148. [Paper] [Slides].

Greenberg, J., Grabus, S., Johs, A., Doran, W., & Adair, J. (2019). Finding FAIR in DataSAR: A Repository of Data Sharing Agreements. Drexel-CODATA FAIR-RRDM Workshop 2019, Philadelphia, Pennsylvania, March 31, 2019. [Paper]

Leipzig, J., & Pascua, S. (2019). A FAIR Trade for Consents: Steps toward Automated Conversion of Genomic Sequencing Consents to Machine-readable Ontologies. Drexel-CODATA FAIR-RRDM Workshop 2019, Philadelphia, Pennsylvania, March 31, 2019. [Paper].

Poole, A. H., & Garwood, D. A. (2019). Digging into data management in public‐funded, international research in digital humanities. Journal of the Association for Information Science and Technology, 71(1), 84-97.

Garwood, D. A., & Poole, A. H. (2019). Pedagogy and public-funded research: an exploratory study of skills in digital humanities projects. Journal of Documentation, 75(3), 550-576.


Greenberg, J. (2018).  Metadata and Toward FAIR Sharing. Shareable Data: Metadata, Issues of Privacy, and Legal Implications. RDA 12th Plenary/International Data Week 2018, Gaborone, Botswana, November 6, 2018. [Slides]

Greenberg, J. (2018). Data Sharing Spoke Initiative. Northeast Big Data Innovation Hub (NEBDIH) 2018 Summit, Columbia, New York, April, 2018. [Slides]

Grabus, S. and Greenberg, J. (2018). Resources for understanding the data sharing landscape: Rights, licensing, and related initiatives. Poster presented at the Research Data Alliance 11th Plenary Meeting. Berlin, Germany. [Poster]

Poole, A. H., & Garwood, D. A. (2018). Interdisciplinary scholarly collaboration in data-intensive, public-funded, international digital humanities project work. Library & Information Science Research, 40(3-4), 184-193.

Poole, A. H., & Garwood, D. A. (2018). Natural allies: Librarians, archivists, and big data in international digital humanities project work. Journal of Documentation, 74(4), 804-826.

Greenberg, J., Grabus, S. (2018). Metadata Solutions and Data Sharing Licensing for Big Data. IEEE Big Data Governance and Metadata Management Workshop, Technical University, Berlin, Germany, March, 2018. [Slides]

Greenberg, J., Poole, A., Grabus, S., Boone, J., Chilutti, M., Lamm, S., & Pennington, J. (2018). Supporting Data Analytics through Data Dictionaries at Children’s Hospital of Philadelphia (CHOP). Presented at the ASIST workshop on Big Metadata Analytics: Setting a Research Agenda for Data-Intensive Future. Vancouver, Canada, November 14, 2018. [Conference Presentation]

Ortiz-Repiso, V., Greenberg, J., & Calzada-Prado, J. (2018). A cross-institutional analysis of data-related curricula in information science programmes: A focused look at the iSchools. Journal of Information Science, 16555151774814. [Early Draft] [Link]

Li, K., & Yan, E. (2018). Co-mention network of R packages: Scientific impact and clustering structure. Journal of Infometrics, 12(1), 87–100. [Paper]

Tosaka, Y., & Park, J. (2018). Continuing Education in New Standards and Technologies for the Organization of Data and Information: A Report on the Cataloging and Metadata Professional Development Survey. Library Resources & Technical Services, 62(1), 4-15.


Storey, V. C., & Song, I.-Y. (2017). Big data technologies and Management: What conceptual modeling can do. Data & Knowledge Engineering, 108, 50-67.
*Winner of 2017 DKE Best Paper Award [Elsevier Press Release]

Leipzig, J. (2017).  Computational Pipelines and Workflows in Bioinformatics. in Reference Module in Life Sciences (Elsevier, 2018) [Book Chapter]

Greenberg, J. (2017). Big metadata, smart metadata, and metadata capital: Toward greater synergy between data science and metadata. Journal of Data and Information Science2(3): 19-36. [Paper]

Grabus, S., & Greenberg, J. (2017). Toward a Metadata Framework for Sharing Sensitive and Closed Data: An Analysis of Data Sharing Agreement Attributes. In E. Garoufallou, S. Virkus, R. Siatri, & D. Koutsomiha (Eds.), Metadata and Semantic Research: 11th International Conference, MTSR 2017, Tallinn, Estonia, November 28 — December 1, 2017, Proceedings (pp. 300–311). Cham: Springer International Publishing. [Paper] [Slides]

Opalek, A., & Greenberg, J. (2017). The Representation of Agents as Resources for the Purpose of Professional Regulation and Global Health Workforce Planning. In E. Garoufallou, S. Virkus, R. Siatri, & D. Koutsomiha (Eds.), Metadata and Semantic Research: 11th International Conference, MTSR 2017, Tallinn, Estonia, November 28 — December 1, 2017, Proceedings (pp. 103–111). Cham: Springer International Publishing. [Paper]

Li, K., Rollins, J., & Yan, E. (2017). Web of Science use in published research and review papers 1997–2017: a selective, dynamic, cross-domain, content-based analysis. Scientometrics, 1–20. [Paper]

Greenberg, J. (2017).  BIG DATA: Balancing Impacts, Investments and Education 2017 ESS/SAES/ARD Fall Meeting: A Question of Balance Workshop, Philadelphia, PA, September 26, 2017 [Slides]

Li, K. & Xu, S. (2017). Measuring the impact of R packages. Presented in the 80th Annual Meeting of Association for Information Science & Technology (ASIS&T). Washington, DC, October 27 – November 1, 2017.

Song, I. and Zhu, Y. (2017). Big Data and Data Science: Opportunities and Challenges of iSchools. Journal of Data and Information Science2(3): 1-18. [Paper]

Greenberg, J., Grabus, S., and Liu, H. (2017). Semantic Analysis and Attribute Clustering: Developing a Data Sharing Agreement Ontology. 11th U.S. Networked Knowledge Organization Systems (NKOS) Workshop. DC-2017. International Conference on Dublin Core and Metadata Applications, Washington, D.C. [Abstract]

Park, J. & Tosaka, Y. (2017). Cataloging and Metadata Professionals’ Experiences and Perspectives on Issues Surrounding Continuing Education. Cataloging and Classification Quarterly 55 (3): 153-171.

Park, J. & Tosaka, Y. (2017). Emerging information standards and technologies: cataloging and metadata professionals’ perspectives. Library Hi Tech News 34 (4): 22-26.

Poole, A. (2017). “A Greatly Unexplored Area”: Digital Curation and Innovation in Digital Humanities. Journal of the Association for Information Science and Technology, 68.

Poole, A. H. (2017). The conceptual ecology of digital humanities. Journal of Documentation, 73(1), 91-122.

Li, K. (2017). An ontology of data events based on GBIF data papers preliminary findings. Poster presented at the Research Data Alliance 10th Plenary Meeting. Montreal, Québec, Canada. [Poster]

Grabus, S. (2017). Advancing rights management metadata best practices across open and closed data sharing communities. Poster presented at the Research Data Alliance 10th Plenary Meeting. Montreal, Québec, Canada. [Poster]

Poole, A. H. (2017). A greatly unexplored area”: Digital curation and innovation in digital humanities. Journal of the Association for Information Science and Technology. [Paper]

Greenberg, J., Grabus, S., Hudson, F., Kraska, T., Madden, S., & Bastón, R., and Naum, K. (2017). The Northeast Big Data Hub: “Enabling Seamless Data Sharing in Industry and Academia” Workshop. Philadelphia, PA: The Northeast Big Data Innovation Hub. [Report]


Duarte, K., Weber, R., & Pacheco, R. C. (2016). Purpose-oriented metrics to assess researcher quality. [Paper]

Greenberg, J., Lin, X., Li, K., & Gong, X. (2016). Transforming Data Adaptation Science and Service: An Innovative Visual Ontology Application: CVDI Year 4 Final Project Reports 2015-2016. [Report]

Li, K., Greenberg, J., & Lin, X. (2016). Software Citation, Reuse and Metadata Considerations: An Exploratory Study Examining LAMMPS. In Proceedings of the 79th ASIS&T Annual Meeting (Vol. 53). [Paper] [Slides]

Leipzig, J. (2016). A review of bioinformatic pipeline frameworks. Briefings in bioinformatics, bbw020. [Paper]

Poole, A. H. (2016). The conceptual landscape of digital curation. Journal of Documentation, 72(5), 961-986.


Krause, E. M., Clary, E., Ogletree, A., Greenberg, J. (2015). Data from: Evolution of an application profile: Advancing metadata best practices through the Dryad data repository. Dryad Digital Repository. [Data]

Greenberg, J., Simmons, I., Ogletree, A., Kavanaugh, C., Goethals, A. (2015). 2D and 3D Format Selection and Metadata Analysis: Final Report. Harvard Library. [Report]

Greenberg, J. (2015). Philosophical Foundations and Motivation via Scientific Inquiry. In Richard Smiraglia and Hur-Li Lee (Eds.), Ontology in Knowledge Organization. Germany: Ergon-Verlag GmbH, 5-12.

Greenberg, J., Zhang, Y., Ogletree, A., Tucker, G. J., & Foley, D. (2015). Threshold determination and engaging materials scientists in ontology design. In Metadata and Semantics Research: 9th Research Conference, MTSR 2015, Manchester, UK, September 9-11, 2015, Proceedings 9 (pp. 39-50). Springer International Publishing.

Zhang, Y., Greenberg, J., Ogletree, A., Tucker, G. (2015). Advancing Materials Science Semantic Metadata via HIVE. In DC-2015: Metadata and Ubiquitous Access to Culture, Science, and Digital Humanities: Proceedings of the International Conference on Dublin Core and Metadata Applications. São Paulo, Brazil, September 1-5, 2015. [Poster]

Krause, E., Clary, E., Ogletree, A., Greenberg, J. (2015). Evolution of an Application Profile: Advancing Metadata Best Practices through the Dryad Data Repository. In DC-2015: Metadata and Ubiquitous Access to Culture, Science, and Digital Humanities: Proceedings of the International Conference on Dublin Core and Metadata Applications. São Paulo, Brazil, September 1-5, 2015. [Paper]

Ogletree, A, Koskela, R., Jeffery, K., Ball, A., Dublin, D., Greenberg, J., Berman, F. (2015). RDA Overview and Metadata Activities. In DC-2015: Metadata and Ubiquitous Access to Culture, Science, and Digital Humanities: Proceedings of the International Conference on Dublin Core and Metadata Applications. São Paulo, Brazil, September 1-5, 2015. [Presentation]

Zhang, Y., Ogletree, A., Greenberg, J., & Rowell, C. (2015). Controlled vocabularies for scientific data: users and desired functionalities. Proceedings of the Association for Information Science and Technology52(1), 1-8.

Rowell, C., Greenberg, J., Zhang, Y., Ogletree, A. (2015). Controlled Vocabularies for Scientific Data: Users and Desired Functionalities. figshare. [Data]

Greenberg, J. (2015). Metadata Capital: Conceptual Understanding, Predictive Value. Presented at Hollywood IT Summit, Los Angeles, CA, May 14, 2015. [Presentation]

Krause, E. M., Clary, E., Ogletree, A., Greenberg, J. (2015). Evolution of a Metadata Application Profile for a Digital Data Repository. Presented at Drexel Research Day, Philadelphia, PA, May 1, 2015. [Poster]

Greenberg, J. (2015). Metadata Capital: Conceptual Understanding, Predictive Value. Presented at MESA – Metadata Madness. New York, NY, March 31, 2015. [Presentation]

Greenberg, J., Simmons, I., Zhang, Y. (2015). Drexel Team Update: DataNet Federation Consortium. March 3, 2015, RENCI, North Carolina. [Presentation]


Greenberg, J., Murillo, A. P., Ogletree, A., Boyles, R., Martin, N., & Romeo, C. (2014). Metadata Capital: Automating Metadata Workflows in the NIEHS Viral Vector Core Laboratory. In MTSR-2014: Proceedings of the 8th Metadata and Semantics Research Conference. Karlsruhe, Germany, November 27-29, 2014, pp. 1-13. [Paper]

Thompson, C. A., Robertson, W. D., & Greenberg, J. (2014). Where Have All the Scientific Data Gone? LIS Perspective on the Data-At-Risk Predicament. College & Research Libraries, 75(6): 842-861. [Paper]

Greenberg, J., Ogletree, A., Murillo, A. P., Caruso, T. P., & Huang, H. (2014). Metadata Capital: Simulating the Predictive Value of Self-Generated Health Information (SGHI). In 2014 IEEE International Conference on Big Data. Washington, DC, October 27-30, 2014, pp. 31-36. [Paper] [Presentation]

Maron, D., Missen, C., & Greenberg, J. (2014). “Lo-Fi to Hi-Fi”: A New Metadata Approach in the Third World with the eGranary Digital Library. In DC-2014: Metadata Intersections: Bridging the Archipelago of Cultural Memory: Proceedings of the International Conference on Dublin Core and Metadata Applications. Austin, Texas, October 8-11, 2014, pp. 37-42. [Project Report]

Ogletree, A. (2014). Metadata Workflows Across Research Domains: Challenges and Opportunities for Supporting the DFC Cyberinfrastructure. In DC-2014: Metadata Intersections: Bridging the Archipelago of Cultural Memory: Proceedings of the International Conference on Dublin Core and Metadata Applications. Austin, Texas, October 8-11, 2014, pp. 184-186.[Poster]

Ling, Y., Greenberg, J., & Koskela, R. (2014). Mapping Human-Readable/Machine-Readable Policies for RDA Metadata Standards Directory Development. Poster presented at the Research Data Alliance Fourth Plenary Meeting. Amsterdam, The Netherlands. [Poster]

Mannheimer, S., Yoon, A., Greenberg, J., Feinstein, E., & Scherle, R. (2014). A Balancing Act: The Ideal and the Realistic in Developing Dryad’s Preservation Policy. First Monday. doi:10.5210/fm.v19i8.5415 [Paper]

Greenberg, J. (2014). Metadata Capital: Raising Awareness, Exploring a New Concept.Bulletin of the Association for Information Science and Technology, 40(4): 30-33. [Bulletin]

White, H., Willis, C., & Greenberg, J. (2014). HIVEing: The Effect of a Semantic Web Technology on Inter-Indexer Consistency. Journal of Documentation, 70(3): 307-329. [Paper]

Ogletree, A., Huang, H., Mathews, A. C. (2014). Quantifying the Value of Metadata Capital for Data Science. Presented at NCDS Showcase 2014. Chapel Hill, NC, May 21, 2014. [Poster]

Murillo, A. P. (2014). Data Sharing and Reuse in the Sciences: An Investigation of Selected Cyberinfrastructure and Interoperability Elements. Bulletin of IEEE Technical Committee Digital Libraries: 1-7. [Paper]

Ball, A. J., Chen, S., Greenberg, J., Perez, C., Jeffery, K. & Koskela, R. (2014). Building a Disciplinary Metadata Standards Directory. 9th International Digital Curation Conference. [Paper] [Presentation]

Greenberg, J. (2014). Metadata Capital via a Linked Data HIVE. Advances in Classification Research Online, 24(1): 59-61. [Paper]


Greenberg, J., Rodriguez, E. M., & de la Fuente, G. B. (Eds.). (2013). Special Issue: Linking Open Vocabularies. Library Hi Tech, 31(4): 569-574. [Editorial]

Greenberg, J., & Garoufallou, E. (2013). Change and a Future for Metadata. In MTSR-2013: Proceedings of the 7th Metadata and Semantics Research Conference. Thessaloniki, Greece, November 19-22, 2013, pp. 1-5. [Paper]

Conway, M. C., Greenberg, J., Moore, R., Whitton, M., & Zhang, L. (2013). Advancing the DFC Semantic Technology Platform via HIVE Innovation. Proceedings of the 7th Metadata and Semantics Research Conference. Thessaloniki, Greece, November 19-22, 2013,pp. 14-21. [Paper]

Mannheimer, S., & Yoon, A. (2013). Developing Preservation Policy for Dryad Digital Repository. SIG-DL Digital Liaisons Panel, Association for Information Science & Technology (ASIS&T) Annual Meeting. Montreal, Canada. [Poster]

Earls, A. C., Clary, E., Greenberg, J., Kirschenfeld, A., Murillo, A. P., Robertsons, W. D., Swauger, S., & Anderson, W. L. (2013). The Data-at-Risk Initiative: A Metadata Scheme for Documenting Data Rescue Activities: 1-3. In iPRES 2013. Lisbon, Portugal. [Paper]

Greenberg, J., Swauger, S., & Feinstein, E. M. (2013). Metadata Capital in a Data Repository. In DC-2013: Proceedings of the International Conference on Dublin Core and Metadata Applications. Lisbon, Portugal, September 2-6, 2013, pp. 140-150. [Paper]

Greenberg, J., Murillo, A., Kunze, J., Callaghan, S., Guralnick, R., Nassar, N., Ram, K., Janee, G., & Patton, C. (2013). Metadictionary: Advocating for a Community-driven Metadata Vocabulary Application. In DC-2013: CAMP-4-DATA Workshop: Proceedings of the International Conference on Dublin Core and Metadata Applications. Lisbon, Portugal, September 2-6, 2013. [Paper]

Greenberg, J., Swauger, S., & Feinstein, E. M. (2013). Data from: Metadata capital in a data repository. Dryad Digital Repository. [Data]

Thompson, C. A., Robertson, W. D., & Greenberg, J. (2013). Where Have All the Scientific Data Gone? LIS Perspective on the Data-At-Risk Initiative. In IDCC13: 8th International Digital Curation Conference. Amsterdam, The Netherlands, January 14-17, 2013. [Poster]

Veitch, M., Greenberg, J., Keizer, C., & Gunther, W. (2013). The UNC–Chapel Hill RDA Boot Camp: Preparing LIS Students for Emerging Topics in Cataloging and Metadata.Cataloging & Classification Quarterly, 51(4): 343-364. [Paper]

Greenberg, J., Rowell, C., Rajavi, K., Conway, M., & Lander, H. (2013). HIVEing Across U.S. DataNets. Research Data Management Implementations Workshop, NSF/Coalition for Academic Scientific Computation (CASC), Arlington, VA, March 13-15, 2013. [Paper]


Greenberg, J., Murillo, A. P., & Kunze, J. A. (2012). Ontological Empowerment: Sustainability via Ownership. 23rd ASIS&T SIG/CR Classification Research Workshop. Baltimore, MD, October 26, 2012, pp. 1-3. [Paper]

Greenberg, J., Trujillo, S., & Mayer-Patel, K. (2012). YouTube: Applying FRBR and Exploring the Multiple Description Coding Compression Model. Cataloging & Classification Quarterly, 50(5-7): 742-762. [Paper]

Méndez, E., & Greenberg, J. (2012). Linked Data for Open vocabularies and HIVE’s Global Framework. El Profesional de la Información, 21(3): 236-244. [Paper] Also published in Spanish as: Datos enlazados para vocabularios abiertos: marco global de HIVE.

Qin, J., Ball, A., & Greenberg, J. (2012). Functional and Architectural Requirements for Metadata: Supporting Discovery and Management of Scientific Data. In DC-2012:Proceedings of the International Conference on Dublin Core and Metadata Applications. Kuching, Sarawak, Malaysia, September 3-7, 2012, pp. 62-71. [Paper]

Murillo, A. P., Carver, N., Greenberg, J., Robertson, W. D., Thompson, C. A., & Anderson, W. L. (2012). Data-At-Risk Initiative: Scientists’ Perceptions of Endangered Data and Data Reuse. 23rd CODATA International Conference (Committee on Data for Science and Technology). Taipei, Taiwan, October 28-31, 2012, pp. 1-6. [Abstract]

Murillo, A. P., Thompson, C. A., Carver, N., Robertson, W. D., Greenberg, J., & Anderson, W. L. (2012). The Data-At-Risk Initiative: Analyzing the Current State of Endangered Scientific Data. American Society of Information Science & Technology Annual Conference. Baltimore, MD, October 26-30, 2012, pp. 1-3. [Paper]

Thompson, C. A., Robertson, W. D., Carver, N., Murillo, A. P., Greenberg, J., & Anderson, W. L. (2012). Scientific Data At Risk: Understanding the Predicament and the Role of Special Librarians. Special Libraries Association Annual Conference. Chicago, IL, July 15-18, 2012. [Poster]

Carver, N., Murillo A. P., Thompson, C., Anderson, B., Greenberg, J., & Robertson, D. W. (2012). Data at Risk Initiative: A Study of Endangered Scientific Data. International Council on Archives Congress. Brisbane, Australia, August 20-24, 2012.

Willis, C., Greenberg, J., & White, H. (2012). Analysis and Synthesis of Metadata Goals for Scientific Data. Journal of the American Society for Information Science and Technology, 63(8): 1505-1520. [Paper]

Fitzgerald, R., Dechman, L., & Willis, C. (2012). HIVE for LC Web Archives: Web Archives and Automatic Subject Indexing. Presentation given at the International Internet Preservation Consortium 2012 General Assembly. Washington, DC, April 30-May 4, 2012, pp. 1-12. [Presentation]

White, H., Willis, C., & Greenberg, J. (2012). The HIVE impact: Contributing to Consistency via Automatic Indexing. In iConference ’12: Proceedings of 2012 iConference. Toronto, ON, Canada, February 7-10, 2012, pp. 1-3. [Poster]


Greenberg, J., Losee, R., Pérez Agüera, J. R., Scherle, R., White, H., & Willis, C. (2011). HIVE: Helping Interdisciplinary Vocabulary Engineering. Bulletin of the American Society for Information Science and Technology, 37(4): 23-26. [Paper]

Anderson, W., Faundeen, J., Greenberg, J., & Taylor, F. (2011). Metadata for Data Rescue and Data at Risk. In PV2011: Ensuring Long-Term Preservation and Adding Value to Scientific and Technical Data. Toulouse, France, November 15-17, 2011, pp. 1-6. [Paper] [Presentation]

Thompson, C. A., Carver, N., Collins, K., Sinclair, J., & Veitch, J. M. (2011). Supporting Scientists in Data Archiving: Emerging Roles for Information Professionals. Digital Liaisons: Student Perspectives on Curating the Information Life Cycle: American Society of Information Science & Technology Annual Conference. New Orleans, LA. [Poster]


Greenberg, J. (2010). Metadata for Scientific Data: Historical Considerations, Current Practices, and Prospects. Journal of Library Metadata, 10(2-3): 75-78. [Paper]

Greenberg, J. (2010). Metadata and Digital Information. In Encyclopedia of Library and Information Science, Third Edition, 3610-3623. New York: Marcel Dekker, Inc. [Paper]

Greenberg, J., Deshmukh, R., Huang, L., Mostafa, J., La Vange, L., Carretta, E., & O’Neal, W. (2010). The COPD Ontology and Toward Empowering Clinical Scientists as Ontology Engineers. Journal of Library Metadata10(2-3), 173-187. [Paper]

Deshmukh, R., Huang, L., Mostafa, J., & Greenberg, J. (2010). SPIRO-V: A Collaborative Approach to Controlled Vocabularies Gathering and Management. Joint International Conferences on Digital Libraries. Gold Coast, Australia, June 21-25, 2010, pp. 371-372. [Paper]

Silbajoris, C., Greenberg, J., Nassar, N., & Schoffner, M. (2010). Go Local Goes Faster: Expediting the Indexing and Auditing Processes. Medical Library Association Annual Conference. Washington, DC, May 21-26, 2010. [Presentation]

Pérez-Agüera, J. R., Arroyo, J., Greenberg, J., Perez-Iglesias, J., & Fresno, V. (2010). Using BM25F for Semantic Search. In WWW 2010: Proceedings of the 19th International Conference on the World Wide Web. Raleigh, NC, April 26-30, 2010, pp. 1-8. [Paper] [Presentation] (SemSearch2010 Best Paper Award Winner)

Pérez-Agüera, J. R., Arroyo, J., Greenberg, J., Perez-Iglesias, J., & Fresno, V. (2010). INEX+DBPEDIA: A Corpus for Semantic Search Engine. In WWW 2010: Proceedings of the 19th International Conference on the World Wide Web. Raleigh, NC, April 26-30, 2010, pp. 1161-1162. [Paper]

Greenberg, J. (2010). The Dryad Repository: A Metadata Best Practice for a Scientific Data Repository. ASIST Research Data Access and Preservation Summit. Phoenix, Arizona, April 9-10, 2010. [Presentation]

Greenberg, J., Daniel, E., Edwards, P., Gollop, C., Kramer-Duffield, J., Woodbury, D., Seiberling, S., Weakley, A., & Shoffner, M. (2010). Cultivating Collaboration and Converging Disciplines via the Bot2.0 Initiative. ALISE 2010 Annual Conference. Boston, MA, January 12-15, 2010.

Kramer-Duffield, J., & Greenberg, J. (2010). Linking Semantic Spaces: Learning Plant Identification in the Digital World of Nature. ALISE 2010 Annual Conference. Boston, MA, January 12-15, 2010.

Scherle, R., & Aguera, J. (2010). HIVE: A New Tool for Working With Vocabularies. Presented at Code4Lib 2010. Asheville, North Carolina, February 22-25, 2010.


Greenberg, J., White, H., Carrier, C., & Scherle, R. (2009). A Metadata Best Practice for a Scientific Data Repository. Journal of Library Metadata9(3), 194-212. [Paper(Exemplary paper; selected by journal’s chief editor for free access)

Greenberg, J. (2009). Theoretical Considerations of Lifecycle Modeling: An Analysis of the Dryad Repository Demonstrating Automatic Metadata Propagation, Inheritance, and Value System Adoption. Cataloging & Classification Quarterly47(3), 380-402. [Paper] (Jesse H. Shera Award for Distinguished Published Research)

Greenberg, J., Lapp, H., Scherle, R., Vision, T., White, H., Carrier, S., & Schaeffer, P. (2009). The Dryad Repository: Designing a Curation Workflow. 5th International Digital Curation Conference. London, UK, December 2-4, 2009.

Edwards, P. M., Daniel, E., Greenberg, J., Kramer-Duffield, J., Taylor, H., Woodbury, D., Seiberling, S., Weakley, A., & Shoffner, M. (2009). Evaluating Technology, Information Literacy, and Content-Related Outcomes Among Undergraduate Students in Face-to-Face and Social Networking Environments. Proceedings of the ASIST 2009 Annual Conference. Thriving on Diversity – Information Opportunities in a Pluralistic World. Vancouver, BC, November 8-11, 2009. [Poster]

Kramer-Duffield, J., & Greenberg, J. (2009). From Novice to Expert: Student Use of Folksonomy and Key-Based Information Resources in Botany. Proceedings of the ASIST 2009 Annual Conference. Thriving on Diversity – Information Opportunities in a Pluralistic World. Vancouver, BC, November 8-11, 2009.

Scherle, R., Bapat, A., Carrier, S., Greenberg, J., Lapp, H., Schaeffer, P., Vision, T., & White, H. (2009). Dryad: A Digital Repository of Date From Publications in Evolutionary Biology and Ecology. Evolution 2009. Moscow, Idaho, June 12-16, 2009.

Scherle, R., Lapp, H., Bapat, A., Carrier, S., Greenberg, J., Schaeffer, P., Vision, T., & White, H. (2009). The Dryad Data Repository. e-Biosphere ’09: International Conference on Biodiversity Informatics. London, UK, June 1-3, 2009.

Scherle, R. (2009). Using DSpace as a Disciplinary Data Repository. Presented at the 4th International Conference on Open Repositories. Atlanta, Georgia, May 18-21, 2009. [Presentation]

Greenberg, J. (2009). Metadata Research Supporting the Dryad Data Repository. Cornell University Library Metadata Working Group Forum, April 17, 2009. [Presentation]

Greenberg, J., Carrier, S., White, H., & Scherle, R. (2009). The Dryad Repository Application Profile: Groundwork Towards a Metadata Scheme for Scientific Data.DigCCurr 2009: Digital Curation Practice, Promise and Prospects. Chapel Hill, NC, April 1-3, 2009.

Greenberg, J. (2009). Theories of Evolution and Cultural Diffusion: The Dryad Repository Case Study for Understanding Changes in Organizing Information Practices. iSociety: Research, Education, Engagement. 2009 iConference. Chapel Hill, NC, February 8-11, 2009. [Paper]

Greenberg, J. (2009). The Semantic Web: Fact or Myth. CENDI, FLICC, and NFAIS Workshop. National Archives, Washington, DC, November 17, 2009. [Presentation]

Greenberg, J. (2009). Interoperable Thesauri: The Challenges and Experiences of the HIVE Project. CENDI/NKOS Workshop. National Agricultural Library, Beltsville, MD, October 22, 2009. [Presentation]


White, H. C. (2008). Exploring Evolutionary Biologists’ Use and Perceptions of Semantic Metadata for Data Curation. In DC-2008: Metadata for Semantic and Social Applications: Proceedings of the International Conference on Dublin Core and Metadata Applications.Berlin, Germany, September 22-26, 2008, p. 202. [Poster]

Shoffner, M., Greenberg, J., Kramer-Duffield, J., & Woodbury, D. (2008). Web 2.0 Semantic Systems: Collaborative Learning in Science. In DC-2008: Metadata for Semantic and Social Applications: Proceedings of the International Conference on Dublin Core and Metadata Applications. Berlin, Germany, September 22-26, 2008, pp. 209-210. [Poster]

White, H., Carrier, S., Thompson, H., Greenberg, J., & Scherle, R. (2008). The Dryad Data Repository: A Singapore Framework Metadata Architecture in a DSpace Environment. InDC-2008: Metadata for Semantic and Social Applications: Proceedings of the International Conference on Dublin Core and Metadata Applications. Berlin, Germany, September 22-26, 2008, pp. 157-162. [Paper]

Scherle, R., Carrier, S., Greenberg, J., Lapp, H., Thompson, A., Vision, T., & White, H. (2008). Building Support for a Discipline-Based Data Repository. Proceedings of the 2008 International Conference on Open Repositories. Southampton, UK, April 1-4, 2008, pp. 1-2. [Poster] [Paper]

Bueno de la Fuente, G. (2008). The simple knowledge organization system (SKOS): A situation report for the HIVE project. [Paper]


Greenberg, J., & Méndez, E. (Eds.). (2007). Knitting the Semantic Web. New York: Haworth Information Press. (Monograph will also be simultaneously published as Cataloging & Classification Quarterly43(3/4): 1-2) [Paper]

Greenberg, J. (2007). Advancing the Semantic Web via Library Functions. Cataloging & Classification Quarterly, 43(3/4): 203-225. doi:10.1300/J104v43n03_11 (Article will also be simultaneously published as a chapter in J. Greenberg & E. Méndez (Eds.). Knitting the Semantic Web. New York: Haworth Information Press.) [Paper] Translated into Serbian: Grinberg, D. (2007). Unapređenje Semantičkog Veba Pomoću Bibliotečkih Funkcija. Glasnik Narodne Biblioteke Srbije (Herald of the National Library of Serbia), 1: 79-95. Available online here

Carrier, S., Dube, J., Greenberg, J., Lapp, H., Thompson, A., Vision, T., & White, H. (2007). The DRYAD Repository: Transforming Scientific Publishing and Data Discovery via the Convergence of Open Access and eScience. Poster session presented at the 2007 Microsoft eScience Workshop at RENCI, October 21-23, 2007, Chapel Hill, NC. [Poster]

Severiens, T., & Greenberg, J. (2007). The DCMI Tools Application Profile. In DC-2007: Application Profiles: Theory and Practice: Proceedings of the International Conference on Dublin Core and Metadata Applications. August 27-31, 2007, Singapore, pp. 30-34. [Paper]

Carrier, S., Dube, J., & Greenberg, J. (2007). The DRIADE Project: Phased Application Profile Development in Support of Open Science. In DC-2007: Application Profiles: Theory and Practice: Proceedings of the International Conference on Dublin Core and Metadata Applications. August 27-31, 2007, Singapore, pp. 35-42. [Paper]

Dube, J., Carrier, S., & Greenberg, J. (2007). DRIADE: A Data Repository for Evolutionary Biology. In JCDL2007: Proceedings of the 2007 Conference on Digital Libraries. Vancouver, BC, June 18-23, 2007, p. 481. [Poster] [Paper]

Greenberg, J., & Severiens, T. (2007). DCMI-Tools: Ontologies for Digital Application Description. In L. Chan and B. Martens (Eds.). ELPUB2007. Openness in Digital Publishing: Awareness, Discovery and Access – Proceedings of the 11th International Conference on Electronic Publishing, Vienna, Austria, June 13-15, 2007, pp. 437-444. [Paper]

Vision, T., Greenberg, J., & Lapp, H. (2007). Data Preservation, Sharing, and Discovery: Challenges for Small Science in the Digital Era. NSF, NESCent, and MRC funded workshop. NESCent, Durham, NC, May 16-17, 2007.


Greenberg, J., Méndez, E., Crystal, A., Sharma, A., Oberlin, J., & Shoffner, M. (2006). Memex Metadata (M2) for Reflective Learning. In DC-2006: Metadata for Knowledge and Learning: Proceedings of the International Conference on Dublin Core and Metadata Applications. Manzanillo, Colima, Mexico, October 3-6, 2006, pp. 169-189. Presented by: Jane Greenberg. [Paper] [Presentation]

Crystal, A., & Greenberg, J. (2006). Relevance Criteria Identified by Health Information Users During Web Searches. Journal of the American Society for Information Science & Technology57(10), 1368-1382. [Paper]

Méndez, E., & Greenberg, J. (2006). Metadata and Ontologies for Organizing Students’ Memories and Learning: Standards and Convergence Models for Content Awareness. InInSciT2006: International Conference on Multidisciplinary Information Sciences and Technologies. Mérida, Spain, October 25-28, 2006, pp. 309-314. Presented by: Eva Méndez. [Paper]

Barreau, D., Crystal, A., Greenberg, J., Sharma, A., Conway, M., Oberlin, J., Shoffner, M., Seiberling, S., Bailey, E., & Baldwin, T. (2006). Augmenting Memory for Student Learning: Designing a Context-Aware Capture System for Biology Education. In ASIST 2006 Annual Meeting: Information Realities: Shaping the Digital Future for All. Austin, Texas, November 3-8, 2006. Presented by: Deborah Barreau and Eva Méndez.

Crystal, A. (2006). Memex Personal Information Management System (MyLifeBits). Presented at the SILS CRADLE (Center for Research and Development of Digital Libraries) Brown Bag Lunch Series, September 15, 2006.

Barreau, D., Crystal, A., Greenberg, J., & Sharma, A. (2006). Personal Information Management in Context. In Now That We’re Talking, What Are We Learning? Personal Information Management in Context Workshop, 29th Annual International ACM SIGIR Conference. Seattle, Washington, August 10-11, 2006, pp. 6-7. Presented by: Deborah Barreau. [Paper] [Poster]

Greenberg, J. (2006). Memex Metadata (M2) for Personal Educational Portfolio. Presented at the MS Research Faculty Summit, Memex Day, July 19, 2006. [Presentation]

Greenberg, J., & Severiens, T. (2006). Metadata Tools for Digital Resource Repositories: JCDL 2006 Workshop Report. D-Lib Magazine12(7/8). [Paper]

Greenberg, J., Spurgin, K., & Crystal, A. (2006). Functionalities for Automatic Metadata Generation Applications: A Survey of Metadata Experts’ Opinions. International Journal of Metadata, Semantics, and Ontologies1(1), 3-20. [Paper]


Greenberg, J. (2005). Understanding Metadata and Metadata Schemes. Cataloging & Classification Quarterly40(3/4), 17-36. doi:10.1300/J104v40n03_02 (Article also published as a chapter in R. P. Smiraglia (Ed.) Metadata: A Cataloger’s Primer. New York: Haworth Information Press.) [Paper]

Greenberg, J., Heidorn, B., Seiberling, S., & Weakly, A. (2005). Growing Vocabularies for Plant Identification and Scientific Learning. In DC-2005: Vocabularies in Practice: Proceedings of the International Conference on Dublin Core and Metadata Applications. Madrid, Spain, September 12-15, 2005, pp. 99-110. (Shorter 2006 version also published in Bulletin of the American Society for Information Science & Technology32(5), 17-19.) [Paper] Translated into Chinese: Greenberg, J.; Heidorn, B. Seiberling, S. and Weakley, A. S. (2006). 为植物鉴定和学习科学建立词表. (Growing Vocabularies for Plant Identification and Scientific Learning). New Technology of Library and Information Service, 1: 33-43. Available online here

Crystal, A., & Greenberg, J. (2005). Usability of a Metadata Creation Application for Resource Authors. Library and Information Science Research27(2), 177-189. [Paper]

Greenberg, J., Spurgin, K., & Crystal, A. (2005). Final Report for the AMeGA (Automatic Metadata Generation Applications) Project. Submitted to the Library of Congress, February 17, 2005. [Report]


Greenberg, J. (2004). Metadata Extraction and Harvesting: A Comparison of Two Automatic Metadata Generation Applications. Journal of Internet Cataloging6(4), 59-82. [Paper]

Robertson, W. D., & Greenberg, J. (2004). Architecting a Cross-Disciplinary Thesaurus for the Semantic Web. In DC-2004: Metadata Across Languages and Cultures: Proceedings of the International Conference on Dublin Core and Metadata Applications. Shanghai, China, October 11-14, 2004, pp. 231-235. [Paper]

Greenberg, J. (2004). User Comprehension and Searching with Information Retrieval Thesauri. Cataloging & Classification Quarterly37(3/4), 103-130. (Article also published as a chapter in S. Roe & A. R. Thomas (Eds.). The Thesaurus: Review, Renaissance and Revision. New York: Haworth Information Press.) [Paper]


Greenberg, J., Crystal, A., Robertson, W. D., & Leadem, E. (2003). Iterative Design of Metadata Creation Tools for Resource Authors. In DC-2003: Supporting Communities of Discourse and Practice: Proceedings of the International Conference on Dublin Core and Metadata Applications. Seattle, Washington, September 28-October 2, 2003, pp. 49-58. [Paper]

Greenberg, J. (2003). Metadata and the World Wide Web. Encyclopedia of Library and Information Science, 1876-1888. New York: Marcel Dekker, Inc. (2002 version published in Encyclopedia of Library and Information Science72 (Suppl. 35), 244-261.) [Paper]

Greenberg, J. (2003). The Semantic Web: More Than a Vision. Bulletin of the American Society for Information Science & Technology29(4), 6-7. [Paper]

Greenberg, J., Sutton, S., & Campbell, G. D. (2003). Metadata: A Fundamental Component of the Semantic Web. Bulletin of the American Society for Information Science & Technology29(4), 16-18. [Paper]

Greenberg, J. (2003). Metadata Generation: Processes, People, and Tools. Bulletin of the American Society for Information Science & Technology29(2), 18-21. [Paper]

Irvin, K. (2003). Comparing Information Retrieval Effectiveness of Different Metadata Generation Methods. Chapel Hill, North Carolina: School of Information and Library Science, UNC-Chapel Hill. [Master’s Thesis]

Newby, G., Greenberg, J., & Jones, P. (2003). Open Source Software Development and Lotka’s Law: Bibliometric Patterns in Programming. Journal of the American Society for Information Science and Technology, 54(2): 169-178. [Paper]


Greenberg, J., & Robertson, W. D. (2002). Semantic Web Construction: An Inquiry of Authors’ Views on Collaborative Metadata Generation. In DC-2002: Metadata for e-Communities: Supporting Diversity and Convergence: Proceedings of the International Conference on Dublin Core and Metadata Applications. Florence, Italy, October 13-17, 2002, pp. 45-52. [Paper]

Harper, C., Greenberg, J., Robertson, W. D., & Leadem, E. (2002). Abstraction versus Implementation: Issues in Formalizing the NIEHS Application Profile. In DC-2002:Metadata for e-Communities: Supporting Diversity and Convergence: Proceedings of the International Conference on Dublin Core and Metadata Applications. Florence, Italy, October 13-17, 2002, pp. 213-215. [Poster]

Greenberg, J., Pattuelli, M. C., Parsia, B., & Robertson, W. D. (2002). Author-Generated Dublin Core Metadata for Web Resources: A Baseline Study in an Organization. Journal of Digital Information2(2). [Paper] (Earlier version published in DC-2001: Proceedings of the International Conference on Dublin Core and Metadata Applications. Tokyo, Japan, October 22-26, 2001, pp. 38-46. [Paper])

Greenberg, J. Bullard, K., James, M. L., Daniel, E., & White, P. (2002). Student Comprehension of Classification Applications in a Science Education Digital Library. Research and Advanced Technology for Digital Libraries, 6th European Conference, ECDL 2002, Rome, Italy, September 16-18, Proceedings. Lecture Notes in Computer Science, pp. 560-567. [Paper]

Dempsey, B., Weiss, D., Jones, P., & Greenberg, J. (2002). Who is an Open Source Developer? A Quantitative Profile of a Community of Open Source Linux Developers.Communications of the ACM, 45(2): 67-72. [Paper]


Robertson, W. D., Leadem, E. M., Dube, J., & Greenberg, J. (2001). Design and Implementation of the National Institute of Environmental Health Sciences Dublin Core Metadata Schema. In DC-2001: Proceedings of the International Conference on Dublin Core and Metadata Applications. Tokyo, Japan, October 22-26, 2001, pp. 193-199. [Paper]

Greenberg, J. (2001). A Quantitative Categorical Analysis of Metadata Elements in Image Applicable Metadata Schemas. Journal of the American Society for Information Science and Technology, 52(11): 917-914. [Paper]

Greenberg, J. (2001). Optimal Query Expansion (QE) Processing Methods with Semantically Encoded Structured Thesauri Terminology. Journal of the American Society for Information Science and Technology, 52(6): 487-498. [Paper]

Greenberg, J. (2001). Metadata Applications for the Plant Information Center (PIC): A Web-based Scientific Learning Center. Interactive Learning Environments, 9(3): 291-313. [Paper]

Greenberg, J. (2001). A Comparison of Web Resource Access Experiments: Planning for the New Millennium. In Proceedings of the Bicentennial Conference on Bibliographic Control for the New Millennium: Confronting the Challenges of Networked Resources and the Web. Library of Congress, Washington, DC, November 15-17, 2000 (ISBN 0-8444-1046-2), pp. 343-355. [Paper]