April 2024 newsletter

Aotearoa New Zealand National Committee on Data in Research (CoDiR) #

Report from the Committee to the CoDiR Forum
April 2024

Summary #

Key CoDiR news included in this report:

Project Work #

Data Landscape Review #

The project is complete. The full report can be found at: https://doi.org/10.15663/UoW.RDLA.DEC2023, and was shared/discussed at meetings with the Chief Science Advisors' Forum, on 31 Jan, and MBIE on 29 Feb.

The report resonated with some other discussions the CSA Forum had been having around the uses of geospatial data, particularly related to disaster response work. They were positive about the report, but pointed out that while it is useful to policy people, decision makers would need more focussed and engaging material. It was suggested that there should be a series of short case studies developed that would show the benefits of enhancing data work in NZ. Ideas are being developed in this space but resource constraints are slowing further work in this space.

Roughly 130 people from the MBIE policy teams were invited to the MBIE session and 50 or so attended – this was apparently a very strong turnout. They liked the Nosek Pyramid approach for the recommendations, and in terms of the presentation that was made (MBIE presentation.pptx - Google Slides), and in a context of fiscal restraint, they also liked the “low hanging fruit’. They were keen on the Landscape Report being more actively disseminated. There was comment on the Australian Research Data Commons move towards supporting domain-based work (as opposed to more work around centralised support of infrastructure/training that is cross-disciplinary in nature). There was discussion of the MBIE roles in policy and in infrastructure (the top and bottom of the pyramid), but it was pointed out that they could/should contribute to the framework for filling the gap between policy and the coalface – and to training and skills uplift that is needed to support better data management and FAIR/CARE principles.

Any further feedback is welcome.

Research Data Culture Conversation #

After developing a baseline of Research Data in Aotearoa in 2023 with the help of Australian RDCC colleagues, we attempted to repeat the exercise in order to determine a growth rate of research data. These results were shared at eResearch NZ in February 2024, indicating a growth rate of about ~25% indicating there was likely 56 PB of research data holdings as of December 2023. This is in line with our Australian colleagues who have measured ~30 % growth rates over the last 5 years or so. Though it's important to note with fewer institutes contributing, the margin of error will be large. Slides from the BoF session, held with Australian RDCC colleagues are available to view: Making research data count.

A second and related BoF was also held at eResearch NZ, Unravelling the data lifecycle. Based on recent work carried out in Australia, as part of the Institutional Underpinnings program of the ARDC. Depicted here as “unravelling” the life cycle diagram, the challenges exposed relate to the nature of the decisions that drive data curation, retention, and disposal in each life cycle stage. Both sessions had good attendance and some lively debate over the idea of deleting research data. Related to this, there is a proposed RDA working group - Active disposal of research data: what you need to know and do before you push delete - at the upcoming RDA virtual plenary 22.

Next steps for the RDCC in both Australia and New Zealand are currently being decided based on the feedback received from the broader community, with Australia currently in talks with CAUDIT and AeRO. There has also been momentum around understanding the role of data stewardship in the context of Institutional Research data holdings.

From the New Zealand side, Claire Rye and Nick Jones from NeSI have been leading the kōrero and would be keen to engage further with interested members of the data community. Please reach out to claire.rye@nesi.org.nz.

Draft Projects #

Review of Research Data Management Planning #

This project will develop recommendations on research data management planning and subsequent implementation. We understand the importance of good research data management planning but are also aware of issues which affect the (non-)delivery or execution of these plans. Tentative working group: Michelle Blake, Richard Hartshorn, Nick Jones, Claire Rye (NeSI), Grace Walker.

Possible future projects have been identified:

Thoughts and suggestions on possible additional projects are very welcome.

Science System Advisory Group open for submissions #

MBIE has announced two advisory groups working in parallel to review the academic and research system in Aotearoa New Zealand. The Science System Advisory Group https://www.ssag.org.nz is currently receiving submissions for the first phase of its review with a deadline of 17 May and a 4 page limit. An initial report back due mid-year and a final report due before the end of this year. CoDiR will be making a submission and encourages submissions while also maintaining a keen interest in hearing from anyone with a view to share. Please contact CoDiR Chair Richard Hartshorn richard.hartshorn@canterbury.ac.nz.

CODATA Unions Forum #

CODATA is looking to establish, in collaboration with the International Science Council, a forum for data-related issues in which scientific unions and related bodies can engage directly to understand their activities and concerns, and identify opportunities for greater collaboration and joint projects. If you have interest in this idea, and particularly if you are connected with an international scientific union, please connect with Richard Hartshorn richard.hartshorn@canterbury.ac.nz.

eResearch NZ 2024 Conference Report and 2025 Conference #

This year marked eResearch NZ’s 15th, and a return to Wellington. The meeting held much of interest for those involved in Research Data. To whet appetites one inspirational talk came from Professor Carole Goble. Carole has a long history of leadership in the UK eScience community, and has been a key driver of data workflows, skills training, and sensitive data capabilities. Carole described the impressive work underway within the biological and life sciences to federate and integrate data and data platforms, along with a compelling view of the potential impact from an emerging protocol called RO Crate. Watch Carole’s keynote! A brief report on the meeting can be found at REANNZ, the host of this year’s event: eResearch 2024 hit the spot for delegates :: REANNZ

The eResearch conference series is well-aligned with many of the interests of CoDiR Forum members, and we believe that this may represent a good opportunity for us to get together. We hope to involve more CoDiR people/activities in this series of meetings. You can find more information on this conference at https://www.eresearchnz.co.nz.

For those interested in the scope of the meeting, the abstract book for the 2024 conference can be found here: eResearch-Abstract-handbook-eadaf72337221f70.pdf and the presentations are available on the eResearch NZ figshare https://eresearchnz.figshare.com/.

NeSI is hosting next year - look out for news on the 2025 edition, you can sign up to receive updates here !

International Data Week 2025 #

International Data Week is a key conference in the area, has important threads in the areas of Indigenous Data Sovereignty, data infrastructure, Open Science, and FAIR Data.

The 2025 edition will be held in Brisbane from October 13-16: International Data Week 2025, Brisbane, Australia - CODATA, The Committee on Data for Science and Technology.

This represents a real opportunity for data-oriented people to connect internationally. Such conferences do not often come to the region and it would be good to have a strong showing from NZ.

WDS News #

Claire Rye is the ECR rep on the WDS Scientific committee. Claire is highlighting the upcoming webinar series and also issuing a general invite to join the WDS early career researcher network, who are keen to recruit members from this side of the world.

CODATA WorldFAIR Reports #

WorldFAIR — the flagship project of the CODATA Decadal Programme, “Making Data Work for Cross-Domain Grand Challenges” — has had a busy start to 2024. In the first three months of the year, we have published eight new reports: for agricultural biodiversity, chemistry, geochemistry, nanomaterials, and social studies research. All these outputs are freely available online.

Agricultural biodiversity

  • WorldFAIR (D10.2) Agricultural Biodiversity Standards, Best Practices and Guidelines Recommendations. This new report presents the results from six pilot studies adopting standards and recommendations from our discovery phase.
    More: https://doi.org/10.5281/zenodo.10666593.
    Alongside this report is a Tutorial that facilitates the standardisation of any spreadsheet containing plant-pollinator interaction data, and assists in sharing the final product in the REBIPP or other platforms. More: https://doi.org/10.5281/zenodo.10688865
  • WorldFAIR (D10.3) Agricultural biodiversity FAIR data assessment rubrics. Introduces a set of FAIR assessment tools tailored to the plant-pollinator interactions domain, to help researchers and institutions evaluate adherence to the FAIR principles.
    More: https://doi.org/10.5281/zenodo.10719265

Chemistry

  • WorldFAIR (D3.2) Training Package: FAIR Chemistry Cookbook. This “Cookbook” trains various user groups in the chemical sciences in the FAIR principles and machine-readable chemical data. It serves as a toolbox of interactive recipes for implementing FAIR at various levels and for various users. More: https://doi.org/10.5281/zenodo.10711950
  • WorldFAIR (D3.3) Utility services for Chemistry Standards. Criteria for web-based services that organisations can implement based on preferred technologies (e.g., toolkits, programming languages). The services are intended to confirm chemical identity and provide feedback on the machine-readability of chemical data and metadata representations based on IUPAC standard rule sets and community best practices, to support a range of stakeholders engaging in chemical data exchange online.
    More: https://doi.org/10.5281/zenodo.10514901

Geochemistry

  • WorldFAIR (D5.2) Geochemistry Methodology and Outreach. Advocating the utility and significance of FAIR Implementation Profiles (FIPs) for the geochemistry community, and presenting a set of policy and organisational recommendations.
    More: https://doi.org/10.5281/zenodo.10406332
  • WorldFAIR (D5.3) Guidelines for implementing Geochemistry FIPs. We guide the geochemistry data infrastructure community towards convergence by identifying FAIR Enabling Resources (FERs) in current use by the geosciences community. We propose creating a reference FIP or catalogue of FERs to promote interoperability and prevent duplication of efforts. More: https://doi.org/10.5281/zenodo.10712808

Nanomaterials

  • WorldFAIR (D4.2) FAIRification of nanoinformatics tools and models
    recommendations
    . Targeted towards nanoinformatics model developers, this report presents a set of recommendations and prototypes for FAIRification of nanoinformatics tools and models. The focus is on tools and software primarily, and also FAIRIfication of the underpinning (and resulting) datasets. More: https://doi.org/10.5281/zenodo.10629631

Social studies

  • WorldFAIR (D6.3) Pilot Testing Harmonisation Workflows. We test the use of standardised workflows based on registry services available at the Australian Data Archive (ADA) and Sikt through their respective Colectica registries. The workflow steps are piloted with the International Social Survey Program (ISSP), to evaluate the Cross-Cultural Survey Harmonisation workflow as a suitable process for machine-to-machine based survey harmonisation. More: https://doi.org/10.5281/zenodo.10724744

For more on all things WorldFAIR, visit us online at http://worldfair-project.eu.

WorldFAIR is funded by the EC HORIZON-WIDERA-2021-ERA-01-41 Coordination and Support Action under Grant Agreement No. 101058393.

ICTD 2024 Conference #

The 13th International Conference on Information & Communication Technologies and Development (ICTD 2024), to be hosted in Nairobi, Kenya from December 9-11, 2024, invites you to submit research papers (http://www.ictd.org).

About: ICTD 2024 provides an international forum for scholarly researchers to explore the role of information and communication technologies (ICTs) in social, political, and economic development. The ICTD conferences have been taking place approximately every 18 months since 2006. The conference reflects the multidisciplinary and multisectoral nature of ICTD work, with representation from a broad range of areas (anthropology, communication, computer science, design, economics, electrical engineering, geography, global health, information science, political science, sociology, and many others), as well as participation from academia, industry, civil society, and government.

Submission Guidelines: Please refer to the instructions on the conference web site – http://www.ictd.org.

Accepted papers will be included in the ACM Digital Library.

Important Dates

  • 15 July 2024: Deadline for submission of Full Papers
  • 15 August 2024: Notification of acceptance for Full Papers
  • 15 September 2024: Camera-ready deadline for Full Papers

All submissions are due 11:59 pm AoE (Anywhere on Earth).

CODATA Working Group on Data Ethics #

The CODATA Working Group on Data Ethics (now a Task Group), has produced three policy briefings on important topics in relation to data ethics. At least two more are in the pipeline.

The policy briefings respond to the UNESCO Recommendation on Open Science. They start from the premise that the Recommendation, in its statements of values and principles, and its other argumentation, is a document with a significant ethical orientation. The policy briefings, therefore, seek to augment the Recommendation and add further considerations and policy guidance for ethical issues in relation to data.

The first three Policy Briefs are available for comment. We will greatly appreciate feedback by 25/04/2024:

Data Ethics and Research Integrity https://doi.org/10.5281/zenodo.10933361

Feedback is welcomed via the form https://forms.gle/SmkhP9XzunRLysGL6 or directly to the authors: Leo Lahti leo.lahti@utu.fi, Joy Jang joyjang@umich.edu, Hu Lianglin hull@cnic.cn, Dirk Hommrich, hommrich@kulturforschung-hd.de, Sun Kun Oh sunkun.oh@gmail.com

Data Ethics and Privacy https://doi.org/10.5281/zenodo.10933546

Feedback is welcomed via the form https://forms.gle/SmkhP9XzunRLysGL6 or directly to the authors: Ashlin Lee Ashlin.Lee@csiro.au, Masanori Arita arita@nig.ac.jp and Steve McEachern steven.mceachern@anu.edu.au

Data Ethics and Structural Inequities in Science https://doi.org/10.5281/zenodo.10933602

Feedback is welcomed via the form https://forms.gle/SmkhP9XzunRLysGL6 or directly to the authors: Suchith Anand Suchith.Anand@nottingham.ac.uk, Louise Bezuidenhout, l.m.bezuidenhout@cwts.leidenuniv.nl, Andrew Cox a.m.cox@sheffield.ac.uk, Johannes John- Langba, JohnLangbaJ@ukzn.ac.za, Sabina Leonelli, S.Leonelli@exeter.ac.uk

WorldFAIR Webinar Series #

April 22 @ 1:00 pm – 2:00 pm (UTC)

Register here

Speakers:

  • Evan Bolton: Protocol Services: Standardized programmatic access to chemical information
  • Stuart Chalk: Training Cookbook: Digital recipes for managing chemical data
  • Iseult Lynch: The Case Study on Nanomaterials

More about the Case Studies:

Chemistry

The WP3 (FAIR Chemistry) aims to align chemistry data standards with the FAIR data principles through:

  • Development of guidelines, tools and validation services that enable scientists to share and store data in a FAIR manner;
  • Addressing gaps in standards that currently restrain chemistry in both academic and industrial areas, in particular taking advantage of developments in AI/ML;
  • Engaging critical stakeholders in the adoption of standards and best practices to significantly increase the amount of chemical data available for all scientific disciplines.

Chemical substances touch on all areas of laboratory science and chemistry underlies many critical worldwide issues, including climate, health, food availability and sustainable development. Increased reporting of machine-readable chemical data will support active research in chemistry and related sciences worldwide, and will be essential to the development of the interdisciplinary science critical to address the UN Sustainable Development Goals and UNESCO’s priorities around Open Science. IUPAC is the world authority on chemical nomenclature, terminology, and standardized methods of measurement, and is engaging in a concerted effort through collaboration with the broader chemistry and data science communities to translate a range of assets and activities into the digital domain. Aligning standards development and implementation with the FAIR data principles will facilitate development of guidelines, tools and validation services that support scientists to share and store data in a FAIR manner and support the ability to compile and interpret data across scientific disciplines.

Nanomaterials

The mapping of existing initiatives to increase the FAIRness of both nanomaterials and mixture toxicity datasets and computational approaches for toxicity and mixture assessment is a critical step towards identifying both the domain- specific features and the general features needed to maximise data and model FAIRness.

Building on this mapping, and the development of a FIP, the case study will foster development and piloting of interoperability standards and guidelines for increasing FAIRness in the interlinked scientific disciplines (chemical toxicity, nanomaterials toxicity and characterisation, risk assessment, advanced materials, environmental science), and across the different domains.

This case study will enable the further adoption of the FAIR principles by the international nanomaterial community and encourage greater alignment with neighbouring disciplines and communities.

It builds on the partners’ successful collaboration in NanoCommons (a research infrastructure for nanoinformatics and FAIR nanomaterials data) and their leadership of the IUPAC InChI Trust efforts to develop a standard extension of the InChI for nanomaterials.

It will test the pilot operationalisation of the FAIR principles; run conference sessions and workshops with stakeholders (including the InChI-for-nano domain experts, and international ‘nano’ database managers and their users) to apply, refine, implement, improve the metrics for FAIR nanosafety datasets; and develop an inventory of FAIR nanoinformatics models and their domains of applicability, underpinning datasets and APIs to support interoperability, including guidelines to further improve the interoperability of nanoinformatics models.

The results will include complete human- and machine-readable nanomaterials data provenance trails that can be implemented in a straightforward way using the distributed FAIRification approach.

CoDiR 2024 Committee Membership #

  • Hamza Ajmal, Livestock Improvement Corporation, Market Analyst
  • Maryam Alavi, Plant and Food Research, Data Science
  • Michelle Blake, University of Waikato/Council of New Zealand University Libraries/Open Access Australasia Executive Board/International Association of University Libraries Executive Board, Library
  • Richard Hartshorn, University of Canterbury/CODATA Executive Committee/IUPAC Secretary General, Chemistry
  • Nick Jones, NeSI/RDA Organisational Advisory Board, Research Software & Data - Skills & Infrastructure
  • Tahu Kukutai, University of Waikato/Ngā Pae o te Māramatanga/Global Indigenous Data Alliance, Demography
  • Barry Milne, University of Auckland/COMPASS, Social Sciences
  • Tom Saunders, University of Auckland, eResearch Engagement
  • James Savage, Southern Institute of Technology | Te Pūkenga, Research Coordinator
  • Grace Walker, University of Otago/Ngāi Tahu Research Centre/Think Analytic Ltd, Psychology/Data Science

International contact people:

  • CODATA: Richard Hartshorn
  • RDA: Nick Jones
  • WDS: Kristin Stock
  • WDS ECR Network: Claire Rye