to Open-Science

Introduction

Creative Commons “Attribution-Share Alike” license icon
Benoît Seignovert
2025-01-14
Science Research and Communication
M2EMJM/M2STPE-EPS | XMS3GU030

Open access

Open source

HAL

Creative commons

Open-Science?

🧐

Science discoveries

  • Philosophical essays
  • Knowledge distributed among followers
  • Claims (rather than proofs)

during the Antiquity (and before)

Ptolemaic cosmology

SHEILA TERRY/SCIENCE PHOTO LIBRARY/Getty Images
  • Scientific proofs
  • Coded papers

during the Renaissance

Vitruvian Man

L. Da Vinci/Gallerie dell'Accademia

Science discoveries

Anagram and cypher

SMAISMRMILMEPOETALEUMIBUNENUGTTAUIRAS

Galileo

Salve, umbistineum geminatum Martia proles

Kepler

Altissimum planetam tergeminum observavi

Galileo

Hail, twinned offspring of Mars

I observed the highest planet of triplets

Science discoveries

Discovering

calculus

⚡️

Newton
Leibniz

Said that he wrote about calculus in the 1660s and 1670s, but did not publish until 1693

Published Nova Methodus pro Maximis et Minimis, a treatise on calculus, in 1684

Science discoveries

1660 | Royal Society

1667 | French Academy of sciences

1793 | 70+ scientific organization

Peer-reviews

1665 | 1st scientific journal

1700 | 30+ journals

1790 | 1050+ journals 

Public publications

Philosophical Transactions of the Royal Society

"Take nobody's word for it"

Science practices

Mertonian norms

1942

  1. Communism: all scientists should have common ownership of scientific goods (intellectual property), to promote collective collaboration; secrecy is the opposite of this norm
  2. Universalism: scientific validity is independent of the sociopolitical status/personal attributes of its participants
  3. Disinterestedness: scientific institutions act for the benefit of a common scientific enterprise, rather than for specific outcomes or the resulting personal gain of individuals within them
  4. Organized skepticism: scientific claims should be exposed to critical scrutiny before being accepted: both in methodology and institutional codes of conduct

R. K. Merton

Mitroff counter-norms

  1. Communism is countered by Secrecy: Scientists protect their newest findings to ensure priority in publishing, patenting, or applications
  2. Universalism is countered by Particularism: Scientists assess new knowledge and its applications based on the reputation and past productivity of the individual or research group
  3. Disinterestedness is countered by Self-interestedness: Scientists compete with others in the same field for funding and recognition of their achievements
  4. Organized skepticism is countered by Organized dogmatism: Scientists invest their careers in promoting their own most important findings, theories, or innovations

1974

Open-Science

1985

Participation in science is theoretically open, especially to those who are selected as peers, gatekeepers, and other agents of social control in science.

 

The imposition of such selection, however, serves to close science to many, at least to deliberations over fundamental matters of quality of knowledge claims and professional comportment.

 

In the end, openness is an interest-bearing idea; it cannot be settled with recourse to facts or logic. It is a matter for political debate, not scientific judgment alone.

Making science more accessible, inclusive and equitable for the benefit of all.

image/svg+xml

👍 Advantages

Open access publication of research reports and data allows for rigorous peer-review

 

Publicly funded science will be publicly available

 

Open science will make science more reproducible and transparent

 

Open science has more impact

 

Open Science can provide learning opportunities

 

Open science will help answer uniquely complex questions

Potential misuse

 

The public may misunderstand science data

 

Low-quality science

 

Entrapment by platform capitalism

 

WEIRD-focus (Western, Educated, Industrialized, Rich and Democratic)

👎 Disadvantages

The idea of open science is to make research results as accessible as possible and as closed as necessary, given that most of the research involved is supported by public funds.

🇫🇷 Open-Science in France

4 axes | 12 recommendations

Generalizing open access to publications

1) Obligation to publish in open access all articles and books resulting from publicly funded calls for proposals

 

2) Support open access economic publishing models that do not require the payment of articles or books processing charges (diamond model)

 

3) Encourage multilingualism and the circulation of scientific knowledge by translating publications by French researchers

Structuring, sharing and

opening up research data

4) Obligation to disseminate publicly funded research data

 

5) Create Recherche Data Gouv, the federated national platform for research data

 

6) Promote widespread adoption of data policies that cover the whole lifecycle of research data, to ensure that they are Findable, Accessible, Interoperable and Reusable (FAIR)

Opening up and promoting

source code produced by research

7) Recognize and support the dissemination under an open source license of software produced by publicly funded research programs

 

8) Highlight the production of source code from higher education, research and innovation

 

9) Define and promote an open source software policy

Transforming practices to make

open science the default principle

10) Develop and value open science skills throughout the educational and career pathways of students and research staff

 

11) Value open science and the diversity of scientific productions in the assessment of researchers, of projects and of universities and research performing organizations

 

12) Triple the budget for open science through the National Fund for Open Science and the Investments for the Future Program

The FAIR principals

Findable

Accessible

Interoperable

Ruseable

Data should be attributed a persistent identifier (e.g. Digital Object Identifier = DOI, ISBN, SWHID, …) to ensure stable access to the resource.


Data should be described using scientific and document metadata.


Data, or at least their metadata, should be indexed or recorded in a research tool, for example via submission to a repository (Zenodo, Recherche Data Gouv, Software Heritage, …) or referencing in a data catalogue.

Ensure the data are Findable for humans and machines via metadata indexing.

Institutional archive

Local archive

Disciplinary registry

Generalist registry

🔭 Indexation portals

🏷 Persistent identifiers

Unique and stable reference for an object or a digital entity (dataset, article, author…)

Online, using a standard, free and open protocol such as https.

 

By authentication for data not in open access.

 

The metadata should remain accessible even if the data are temporarily inaccessible or if there is restricted access to the data.

Allow Access to the data and metadata

Open-Access

2002

An old tradition and a new technology have converged to make possible an unprecedented public good. The old tradition is the willingness of scientists and scholars to publish the fruits of their research in scholarly journals without payment, for the sake of inquiry and knowledge. The new technology is the internet. The public good they make possible is the world-wide electronic distribution of the peer-reviewed journal literature and completely free and unrestricted access to it by all scientists, scholars, teachers, students, and other curious minds.

💶 Free (Gratis)

🔓 Open

👉 Call for the use of digital tools such as open archives and open access journals,

free of charge for the reader 📚. The cost is covered by the scientific institutions 🏛️.

💸 Article Processing Charge / APC

💎 Diamond open-access journals

2 different models to get be in Open-Access

Community driven journals

Usually founded by academic or non-profit organizations (~ 200€ per article)

Lower number of articles per-year

For-profit journals (Nature, Science, Elsevier, Wiley, …)

Authors (or their institution) will cover the price for publication

 Couperin agreement (132 M€ between 2024-2027) to pool the publication cost (~ 3,000 € per article)

None in planetary-science, yet…

📖 Preprint registries

HAL

Astrophysics

Biology

Generic

= last version before publication

The data should be described at the beginning of their lifecycle using controlled vocabulary.

 

The metadata should refer to other data that can be linked together.

 

The file formats used should be open and documented to ensure exploitable and persistent data using different tools.

Make the data Interoperable to ensure they can be used regardless of
the computing environment used by humans or machines

How scientist keep the

rights to their discovery

while also allowing for

external use?

🔬

📝 Open-License

The license defines the elements of

paternity and the conditions of use (copyright).

An open license defines the conditions for the distribution and reuse of content (copyleft).

Open-Licenses

Creative Commons

License Ouverte (Etalab)

Open-Source licenses:

BSD, MIT, Apache, GPL, LPGL, AGPL, MPL, EPL…

Creative Commons licenses

No attribution / no copyright

Copy, modification et redistribution.

Author attribution

Copy, modification and redistribution.

Author attribution

Copy, modification and redistribution under the same license.

Author attribution

Copy and redistribution possible but no modification.

No commercial use

The metadata should have several useful attributes to facilitate the comprehension and reuse of the data.


A license for reuse should be associated with the data.

 

The description of the data should indicate their provenance.


The structure of the data should be in line with the standards of the scientific community to facilitate their analysis.

Allow the data to be reused for future research

The 4R in Open Science

Repeatability

Replicability

Reusability

Reproducibility

Same Lab / Team

Same Algorithm / Method

Same Model / Code

Same Data / Metadata

Same Environment / Setup

Different Lab / Team

Same Algorithm / Method

Same Model / Code

Same Data / Metadata

Different Environment / Setup

Different Lab / Team

Same Algorithm / Method

Adapted Model / Code

Different Data / Metadata

Different Environment / Setup

Different Lab / Team

Adapted Algorithm / Method

Adapted Model / Code

Different Data / Metadata

Different Environment / Setup

📊 Open-data

👀 Good data

🔭 Example 1 👀

🛰️ Searching for a recent paper… 🔍

🙁

Guerri et al. 2018

🙄

Request a copy to the library

🤬

No available anywhere…

🪐 Example 2 👀

🎊 My 1st paper as lead author 🎉

👍

👎

Seignovert et al. 2017

🤕 No data…

🙄 No sources…

😳

Seignovert et al. 2017

😬

🖨️ Preprint available 🙌

arxiv: 1704.00842

🔒 Editor version

🔓 Preprint (before publication)

☁️ Example 3 👀

📸 My latest paper (as co-author) ✨

Yahn et al. 2025

👍

Yahn et al. 2025

👍

Dataset

👍

Source-code

👍

Open Science

Publications

Data

Metadata

Methods

Source-code

Open Science

Sovereignty

Integrity

Transparency Reuse

Accessibility

Equity

Any questions ?

🧑‍🏫

Introduction to Open Science

By Benoît Seignovert

Introduction to Open Science

Science Research and Communication Lecture | M2EMJM/M2STPE-EPS

  • 41