The ‘Framework for Open and Reproducible Research Teaching’ (FORRT) community presents a crowdsourced glossary of open scholarship terms to facilitate education and effective communication between experts and newcomers. Glossary available here (https://forrt.org/glossary/), and a version of the Nature article is available to read only here.
Glossary of terms related to Open Research (selected from the Foster Glossary and Open Science Training Handbook)
- Accepted author manuscript - the version of a manuscript that has been accepted by a publisher for publication.
- Altmetrics are alternative ways of recording and measuring the use and impact of scholarship. Rather than solely counting the number of times a work is cited in scholarly literature, alternative metrics also measure and analyse social media (e.g., Facebook, Twitter, blogs, wikis, etc.), document downloads, links to publishing and unpublished research, and other uses of research literature, in order to provide a more comprehensive measurement of reach and impact.
- Article-level metrics - all types of article-level metrics including download and usage statistics, citations, and article-level altmetrics
- Article Processing Charge (APC) - a fee charged to the author, creator, or institution to cover the cost of an article, rather than charging the potential reader of the article. APCs may apply to both commercial and Open Access publications. APCs are sometimes charged to authors in order to cover the cost of publishing and disseminating an article in an Open Access scholarly journal.
- Bibliodiversity is cultural diversity applied to the world of books. Echoing biodiversity, it refers to the critical diversity of products (books, scripts, eBooks, apps, and oral literature) made available to readers.
- Copyright - The aspect of Intellectual property that gives creators the right to permit (or not permit) what happens to their creations, as opposed to trademark rights or moral rights.
- Creative Commons - a suite of licences that set out the rights of authors and users, providing alternatives to the standard copyright. CC licences are widely used, simple to state, machine readable and have been created by legal experts. There are a variety of CC licences, each of which use one or more clauses. Find out more here. Also check out our 10 Things You Should Know About Open Licenses.
- Creative work - an original, identifiable piece of content, such as an academic paper, a diagram, a photograph, or a video clip. Owners of creative works have rights, such as copyright, that they might reserve to keep control of the content, or relinquish to allow others to share and reuse that content
- Data in the sense used here are all digitally available objects (simple or complex) that emerge or are the result of the research process.
- Data Mining - an analytic process designed to explore data in search of consistent patterns or systematic relationships between variables, transforming data into information for future use.
- Digital Object Identifier (DOI) - a unique text string that is used to identify digital objects such as journal articles, data sets or open source software releases. A DOI is one type of Persistent Identifier (PID).
- Documentation - a documentation is detailed information as well as background and methodological approach about the data or code (e.g., description of the project, variables, and measuring instruments).
- Embargo period - a length of time imposed on a research output for users who have not paid for access, or do not have institutional access, before it is made freely available.
- FAIR Data (according to FORCE11 principles and published in Nature Scientific Data) are Findable, Accessible, Interoperable, and Re-usable, in order to facilitate knowledge discovery by assisting humans and machines in their discovery of, access to, integration and analysis of, task-appropriate scientific data and their associated algorithms and workflows.
- Gamification - the use of game design elements and game mechanics in non-game contexts, such as education where it can be used to bring extra engagement.
- GDPR (General Data Protection Regulation) seeks to create a harmonised data protection law framework across the EU. It aims to restitute the control of personal data to citizens, whilst imposing strict rules on those hosting and 'processing' these data, anywhere in the world. The Regulation also introduces rules relating to the free movement of personal data within and outside the EU.
- Green OA making a version (generally the author’s accepted manuscript) of the manuscript freely available in a repository
- Hybrid - a type of journal in which certain articles are made open access for typically a significantly higher price (relative to full OA journals), while others remain toll access.
- Impact Factor - a numerical measure that indicates the average number of citations to articles published over the previous two years in a journal. It is frequently used as a proxy for a journal's relative importance. Its transfer to the impact of individual articles published in a journal is considered to be problematic.
- Institutional Repository - an online database designed to collect the intellectual output of a particular institution or university, including digital collections such as electronic theses and dissertations (ETDs), pre-prints, or faculty scholarship, and presents associated metadata regarding these items
- Intellectual Property (IP) - a legal term that refers to creations of the mind. Examples of intellectual property include music, literature, and other artistic works; discoveries and inventions; and phrases, symbols, and designs.
Intellectual Property Rights (IPR) - the rights given to the owners of intellectual property. IPR is protected either automatically (eg copyright, design rights) or by registering or applying for it (eg trademarks, patents). Protecting your intellectual property makes it easier to take legal action against anyone who steals or copies it. IPR can be legally sold, assigned or licenced by the creator to other parties, or joint-owned. - Journal - a series of published research articles. Historically divided into volumes and issues.
- Legacy publisher - a publisher that historically has operated on a paywall-based business model
- License - a license allows a third party to perform certain actions with a work or data. The license informs about the usage rights of a resource (e.g. text, data, source code).
- Metadata provide a basic description of the data, often including authorship, dates, title, abstract, keywords, and license information. They serve first and foremost the findability of data (e.g. creator, time period, geographic location).
- Open Access (OA) refers to online, free of cost access to peer reviewed scientific content with free reusability regarding copyright restrictions. Open Access makes peer reviewed scholarly manuscripts freely available via the Internet, permitting any user to read, download, copy, distribute, print, search, or link to the full text of these articles, crawl them for indexing, pass them as data to software, or use them for any lawful purpose, without financial, legal or technical barriers other than those inseparable from gaining access to the internet itself. The only constraint on reproduction and distribution, and the only role for copyright in this domain, should be to give authors control over the integrity of their work and the right to be properly acknowledged and cited. May also refer to theses, books, book chapters, monographs and other content
- Open Access Journal - a journal that exclusively comprises open access articles.
- Open Access Publisher - a publisher that publishes all research articles as open access articles. Most legacy publishers have options to make journals at least partially open access
- Open Data are online, free of cost, accessible data that can be used, reused and distributed provided that the data source is attributed. Open data is data freely available on the public internet permitting any user to download, copy, analyse, re-process, pass them to software or use them for any other purpose without financial, legal, or technical barriers other than those inseparable from gaining access to the internet itself.
- Open Educational Resources (OER) - high quality, openly licensed, online educational materials for sharing, use, and reuse. They act as a mechanism for instructional innovation as networks of teachers and learners share best practices
- Open Evaluation - the development of a fair evaluation system or protocol for research proposals, based on transparency of the process and those involved.
- Open Lab Notebooks - a concept of writing about research on a regular basis, such that research notes and data are accumulated and published online as soon as they are obtained.
- Open Materials - sharing of research materials, for example, biological and geological samples, is another Open Science practice.
- Open Peer Review - an umbrella term for a number of overlapping ways that peer review models can be adapted in line with the aims of Open Science, including making reviewer and author identities open, publishing review reports and enabling greater participation in the peer review process.
- Open Science is the movement to make scientific research, data and dissemination accessible to all levels of an inquiring society. Interchangeable with Open Research.
- Open Source - availability of source code for a piece of software, along with an open source license permitting reuse, adaptation, and further distribution. Peer Review A process by which a research article is vetted by experts from the community before publication.
- Paywall - restriction via a financial barrier to research, often implemented by legacy publishers. Can be removed by personal or institutional subscription.
- Persistent Identifier (PID) - a persistent identifier (also PID) is a unique and stable denomination (reference) of a digital resource (e.g. research data) through allocation of a code that can be persistently and explicitly referenced on the internet
- Persistent/Preferred File Formats - non-proprietary formats that follow documented international standards, are commonly used by the research community, use standard character encoding (e.g. ASCII, UTF-8), and were compression, if used at all, is lossless.
- Post-print - a manuscript draft after it has been peer reviewed.
- Preprint - a manuscript draft that has not yet been subject to formal peer review, distributed to receive early feedback on research from peers.
- Preregistration - researchers have the option or are required to submit important information about their study (for example: research rationale, hypotheses, design and analytic strategy) to a public registry before beginning the study. Preregistration can help counter reporting bias.
- Publishing - to make a research output available to the public. Commonly refers to the release of works by publishers, irrespective of whether public access is granted or not
- README file - file where you document your research data. The documentation should be sufficient to enable other researchers to understand, replicate or reproduce the data or reuse them in any other way.
- Repeatability - the similarity between results of a study or experiment and independent results obtained with the same methods and under identical conditions (i.e., pertains to methods and analysis).
- Reporting Bias - reporting bias occurs when certain aspects of a study are systematically not reported transparently, creating wastage and redundancy through selective reporting or non-publishing.
- Repository is defined as the infrastructure and corresponding service that allows for the persistent, efficient and sustainable storage of digital objects (such as documents, data and code).
- Repository (article) - an archive to deposit manuscripts. These can be personal, institutional, on websites such as ResearchGate or Academia.edu, or subject-based such as arXiv.
- Repository (software) - a collection of files managed with version control software (e.g., bzr, hg, git, csv, svn, etc.). Can be hosted by third-party (e.g., github, bitbucket, sourceforge), by an institution, or self-hosted locally.
- Reproducible Research - reproducibility is a spectrum and instructors should choose the definition most used by their audience. Generally speaking, reproducible research makes it possible to obtain similar results of a study or experiment and independent results obtained with the same methods but under different conditions (i.e., pertains to results). Some break the definition into levels of reproducibility, including computationally reproducible (also called "reproducible"): where code and data can be analyzed in a similar manner as in the original research to achieve the same results, and empirically reproducible (also called “replicable”): where an independent researcher can repeat a study using the same methods but creating new data.
- Research Impact - involve academic, economic and societal aspects, or some combination of all three. Impact is the demonstrable contribution that research makes in shifting understanding and advancing scientific, method, theory and application across and within disciplines, and the broader role that this plays outside of the research system.
- Research Funder - an institute, corporation or government body that provides financial assistance for research.
- Scholarly Communication - the creation, transformation, dissemination, and preservation of knowledge related to teaching, research, and scholarly endeavors; the process of academics, scholars and researchers sharing and publishing their research findings so that they are available to the wider academic community.
- Sharing - the joint use of a resource or space. A fundamental aspect of collaborative research. As most research is digitally-authored & digitally-published, the resulting digital content is non-rivalrous and can be shared without any loss to the original creator.
- Subscription - a form of business model whereby a fee is paid in order to gain access to a product or service - in this case, the outputs of scholarly research. Trainer The moderator and instructor of a training, whose role is to ensure the training objectives are met, run the practice, and ensure no one is left out.
- Training is any organised activity that teaches, informs, or transfers skills or knowledge on specific useful competencies through active, engaged learning. conventionally named, standardised delivery method that is applied by a trainer and includes any number of the pedagogical tools necessary (i.e. motivation/demotivation, hands-on approaches, etc).
- Version Control is the management of changes to documents, computer programs, large websites, and other collections.