Benoît Seignovert
🌍 Planetary data scientist 🌔
Benoît Seignovert
2025-01-14
Science Research and Communication
M2EMJM/M2STPE-EPS | XMS3GU030
Open access
Open source
HAL
Creative commons
SMAISMRMILMEPOETALEUMIBUNENUGTTAUIRAS
Salve, umbistineum geminatum Martia proles
Altissimum planetam tergeminum observavi
Hail, twinned offspring of Mars
I observed the highest planet of triplets
⚡️
Leibniz
Said that he wrote about calculus in the 1660s and 1670s, but did not publish until 1693
Published Nova Methodus pro Maximis et Minimis, a treatise on calculus, in 1684
1660 | Royal Society
1667 | French Academy of sciences
1793 | 70+ scientific organization
1665 | 1st scientific journal
1700 | 30+ journals
1790 | 1050+ journals
Philosophical Transactions of the Royal Society
"Take nobody's word for it"
1942
R. K. Merton
1974
1985
Participation in science is theoretically open, especially to those who are selected as peers, gatekeepers, and other agents of social control in science.
The imposition of such selection, however, serves to close science to many, at least to deliberations over fundamental matters of quality of knowledge claims and professional comportment.
In the end, openness is an interest-bearing idea; it cannot be settled with recourse to facts or logic. It is a matter for political debate, not scientific judgment alone.
Open access publication of research reports and data allows for rigorous peer-review
Publicly funded science will be publicly available
Open science will make science more reproducible and transparent
Open science has more impact
Open Science can provide learning opportunities
Open science will help answer uniquely complex questions
Potential misuse
The public may misunderstand science data
Low-quality science
Entrapment by platform capitalism
WEIRD-focus (Western, Educated, Industrialized, Rich and Democratic)
4 axes | 12 recommendations
1) Obligation to publish in open access all articles and books resulting from publicly funded calls for proposals
2) Support open access economic publishing models that do not require the payment of articles or books processing charges (diamond model)
3) Encourage multilingualism and the circulation of scientific knowledge by translating publications by French researchers
4) Obligation to disseminate publicly funded research data
5) Create Recherche Data Gouv, the federated national platform for research data
6) Promote widespread adoption of data policies that cover the whole lifecycle of research data, to ensure that they are Findable, Accessible, Interoperable and Reusable (FAIR)
7) Recognize and support the dissemination under an open source license of software produced by publicly funded research programs
8) Highlight the production of source code from higher education, research and innovation
9) Define and promote an open source software policy
10) Develop and value open science skills throughout the educational and career pathways of students and research staff
11) Value open science and the diversity of scientific productions in the assessment of researchers, of projects and of universities and research performing organizations
12) Triple the budget for open science through the National Fund for Open Science and the Investments for the Future Program
Findable
Accessible
Interoperable
Ruseable
Data should be attributed a persistent identifier (e.g. Digital Object Identifier = DOI, ISBN, SWHID, …) to ensure stable access to the resource.
Data should be described using scientific and document metadata.
Data, or at least their metadata, should be indexed or recorded in a research tool, for example via submission to a repository (Zenodo, Recherche Data Gouv, Software Heritage, …) or referencing in a data catalogue.
Unique and stable reference for an object or a digital entity (dataset, article, author…)
Online, using a standard, free and open protocol such as https.
By authentication for data not in open access.
The metadata should remain accessible even if the data are temporarily inaccessible or if there is restricted access to the data.
2002
An old tradition and a new technology have converged to make possible an unprecedented public good. The old tradition is the willingness of scientists and scholars to publish the fruits of their research in scholarly journals without payment, for the sake of inquiry and knowledge. The new technology is the internet. The public good they make possible is the world-wide electronic distribution of the peer-reviewed journal literature and completely free and unrestricted access to it by all scientists, scholars, teachers, students, and other curious minds.
👉 Call for the use of digital tools such as open archives and open access journals,
free of charge for the reader 📚. The cost is covered by the scientific institutions 🏛️.
2 different models to get be in Open-Access
Community driven journals
Usually founded by academic or non-profit organizations (~ 200€ per article)
Lower number of articles per-year
For-profit journals (Nature, Science, Elsevier, Wiley, …)
Authors (or their institution) will cover the price for publication
Couperin agreement (132 M€ between 2024-2027) to pool the publication cost (~ 3,000 € per article)
None in planetary-science, yet…
HAL
Astrophysics
Biology
Generic
= last version before publication
The data should be described at the beginning of their lifecycle using controlled vocabulary.
The metadata should refer to other data that can be linked together.
The file formats used should be open and documented to ensure exploitable and persistent data using different tools.
The license defines the elements of
paternity and the conditions of use (copyright).
An open license defines the conditions for the distribution and reuse of content (copyleft).
Creative Commons
License Ouverte (Etalab)
Open-Source licenses:
BSD, MIT, Apache, GPL, LPGL, AGPL, MPL, EPL…
No attribution / no copyright
Copy, modification et redistribution.
Author attribution
Copy, modification and redistribution.
Author attribution
Copy, modification and redistribution under the same license.
Author attribution
Copy and redistribution possible but no modification.
No commercial use
The metadata should have several useful attributes to facilitate the comprehension and reuse of the data.
A license for reuse should be associated with the data.
The description of the data should indicate their provenance.
The structure of the data should be in line with the standards of the scientific community to facilitate their analysis.
Same Lab / Team
Same Algorithm / Method
Same Model / Code
Same Data / Metadata
Same Environment / Setup
Different Lab / Team
Same Algorithm / Method
Same Model / Code
Same Data / Metadata
Different Environment / Setup
Different Lab / Team
Same Algorithm / Method
Adapted Model / Code
Different Data / Metadata
Different Environment / Setup
Different Lab / Team
Adapted Algorithm / Method
Adapted Model / Code
Different Data / Metadata
Different Environment / Setup
🙁
🙄
🤬
👍
👎
🤕 No data…
🙄 No sources…
😳
arxiv: 1704.00842
🔒 Editor version
🔓 Preprint (before publication)
👍
👍
👍
👍
By Benoît Seignovert
Science Research and Communication Lecture | M2EMJM/M2STPE-EPS