Archive for category data standards

DITA – A framework for scientific publishing?

Posted by peanutbutter in data standards, Journals Publishing on February 1, 2011

There are two industry recognised standards for XML based documentation. These are Docbook and DITA (Darwin Information Typing Architecture).

Docbook is the older of the two specifications and created specifically for technical documentation. DITA, is a younger specification which grew out of IBM, and is referred to as having its own architecture and was designed to provide structure to more than just a book. Both specifications are OASIS standards.

As with XML schemas, both specifications can be extended to include bespoke features. However, Docbook is based more on a book structure with Sections and subsections, where as DITA is built around topics that can be built up in any arrangement based on a document map. A DITA topic is open to specialisation itself, however, a topic has only three required elements

An id attribute
A title
A body

A topic also has numerous optional elements, utilising HTML syntax. e.g,

A topic can exist as a single XML file which can be composed into any arrangement for publication through the use of a document map. A DITA structure would present a more flexible architecture where the same “topic”, i.e a journal article section, such as an abstract, materials and methods, or results, could be included with ease more than one publication, correctly referenced. In this respect DITA is more like an object-oriented document schema, and can be more easily repurposed (in terms of structure) for any output format (i.e pdf, HTML). In the same respect, Docbook can be configured with some work to behave on a more topic by topic basis and DITA can support a book based methodology. They are after all both XML schemas and are equally extensible or open to specialisation.

As its a standard, whole ecosystems have emerged which makes use of the DITA architecture. For example, DITA for publishers provides libraries to convert DITA markup into HTML, PDF, EPUB, and Kindle rendering support. This allows content structures in DITA to be repurposed for different audiences or different devices with relative ease.

I have recently started using DITA as an architecture to represent content, primarily designed for books. However, with new demands appearing for different delivery mechanisms of the traditional textbook, such as Web delivery and ebooks, DITA is proving to be immensely powerful to deliver the same content through different mediums with relative ease and speed. In using it, it seems obvious that a DITA architecture would benefit the representation of content within a journal article, allowing references re-purposing and multiple format delivery. Maybe a topic for discussion through the Beyond the PDF forum.

In the end, it’s just XML, so I wont repeat the virtues of content markup through XML. However, for me its main advantage is the object oriented -like topic structure as a working architecture.

Using RDFa with DITA and DocBook (devx.com)
Dita Educational Use cases (docs.oasis-open.org)
Converting documents between a wiki and Word, XML, FrameMaker or other help formats (ffeathers.wordpress.com)
Future-proofing e-books with XML (teleread.com)
The PDF Landscape for DITA Content (tc.eserver.org)

beyondthepdf, BTPDF, Darwin Information Typing Architecture, Data Formats, DITA, Docbook, HTML, IBM, Markup Languages, Scientific journal, scientific publishing, XML, XML schema

Leave a comment

Attribution vs Citation: Do you know the difference?

Posted by peanutbutter in bioinformatics, data standards, Journals Publishing, life-science, ontology, open data, open science on July 10, 2009

This is a cross-posted , two-author item available both from this and Allyson’s blog.

Often the words “attribution” and “citation” are used interchangeably. However, in the context of ensuring your work gets the referencing it deserves when others make use of it, it is important that the differences between these two concepts are clear. This article outlines the differences between attribution and citation, and suggests that what most scientists are interested in is not attribution, which can be ensured via licensing restrictions, but instead citation, which is a much tougher nut to crack.

At ISMB last week, there were a number of conversations about the difference between attribution and citation. This topic was brought up again yesterday in a conversation between the two authors of this post, Frank and Allyson. It is an important distinction which is explored in this post.

First, some definitions for attribution and citation. These are not the only definitions possible, but for the purposes of this discussion, please keep these in mind.

Attribution: Acknowledgement of the use of someone else’s information, data, or other work. Crucially, while Wikipedia has a fairly straightforward definition of citation, it does NOT mention even common ways that attribution should be implemented (see Wikipedia attribution page).

Citation: When you publish a paper that makes use of someone else’s information (data, ontology, etc.), you include in that paper a reference to the work of that other person or group. Wikipedia states that it is a “reference to a published or unpublished source” whose prime purpose is of “intellectual honesty”.

Distinguishing between attribution and citation.
You can imagine that citation is a specific type of attribution, but attribution itself can be performed in any number of ways. For scientists, citation is much more useful to their careers as a result of the publish or perish environment.

So, what could attribution consist of? First, let’s take as an example the re-use of someone else’s ontology or specific sub-parts or classes of that ontology. Each class in an ontology is identified by a URI. Therefore, is importing the URI enough? With a URI is it clear where you got the class from? If it’s not enough, where do you put that reference or statement that you are re-using other classes: within the overall metadata of your own ontology? Alternatively, when attributing data is a reference to the originating paper or URL from where you downloaded the data enough? Where do you put that reference: within the metadata of your own document? As a citation? How much is enough attribution?

These questions cannot easily be answered.

A common-sense answer to the question of properly fulfilling requirements is to, at a minimum, first cite their information in your paper, and second include URL(s)/URI(s) in your metadata. But here we get to the crux of the matter: we’ve now stated that a useful way to ensure attribution is to cite the other person. But, if you think carefully, what’s more important for your impact assessments, and your work? It’s actually the citation itself. Sure, acknowledgement via extra referencing in the metadata of the person using your information is great, but what you really need is a citation in their work. If we aren’t careful, we will all make the easy mistake of conflating citation in papers with importing a licensed piece of information and how to mark its inclusion: the former is what we often are scored on and what we would really like, while the latter is the only thing a license enforces. Licensing with attribution requirements is not citation; you can make use of a licensed ontology, but this does not require you to cite it in a paper.

Attribution: the legal entity.

Important point: It’s easy to use a license such as the CC-BY, thinking that you’ll ensure citation, when in fact all you’re doing is ensuring attribution.

What are the implications of attribution? It can quickly get out-of-control and difficult to manage.
By requiring attribution in an ontology or data file, if someone imports information (such as a class from an ontology) into their own document, the new one must attribute the original. Continuing the ontology analogy, if there are 20-30 ontologies being used for a single project (which is not inconceivable in the coming years), there could be great difficulty in maintaining attribution for them all.

Important point: While licenses such as the CC-BY allow the attribution to be performed “in the manner specified by the author or licensor”, this could lead to 30 different licensors requiring potentially 30 different methods of attribution, and attribution stacking isn’t pretty.

Citation: the gentlemen’s club.

Can citation be assured? No. Well, maybe.
You can imagine citation as a gentlemen’s club, as propriety dictates that you should cite another’s work that you use, but there is no legal requirement to do so. Indeed, many believe that citation should not be enforced anyway. In contrast, attribution as required by licenses is a legal statement. However, let’s revisit the clause in CC-BY that states the author/licensor can specify the manner in which the attribution is given.

Important point: Could you use a license such as CC-BY, and state that the attribution must come in the form of, at a minimum, citation in the paper which describes the work being performed by the licensee?

Bottom line: which one is more important to you, as a scientist? Depends on the context.
This is difficult to answer. There aren’t very many guidelines available for us to analyse. The OBO Foundry does have a set of principles, the first of which states that “their [the ontology(ies) and their classes] original source is always credited and that after any external alterations, they must never be redistributed under the same name or with the same identifiers”. However, how this credit is attained is unclear, as described in various blog posts (Allyson, Frank, Melanie). As a result, the following conclusions came out of the OBO Foundry workshop this summer (Monday outcomes): it is “unclear if each ontology should develop their own bespoke license or use develop ‘CC-by’; how to give attribution? Generally use own judgment, here MIREOT mechanism can help when importing external terms into an ontology, giving class level attribution” (MIREOT web page, see also OWLED 2008 paper). Therefore, while they are aware of the problem, they don’t offer a consensus solution(s).

The flipside of this is that in order to use an ontology, you first have to write a paper and cite the classes you wish to import, then get on with the work. If you never get a paper and therefore a citation, is you ontology/data illegal? If you take the example of OBI, which imports several other ontologies and is an open community of developers, would a license restriction requiring citation actually prevent the work starting? This is probably a bit of a chicken-and-egg scenario, if it were ever to come a reality. In short, while there are some tempting possibilities, there doesn’t yet seem to be a useful solution.

In summary, it’s generally not attribution that people want (which can be licensed, even if you don’t like the layers of attribution that will require once you’re using multiple sources) but citation, which isn’t so easily licensed – yet. When deciding what sort of license to use (e.g. an open one like CC0 or an attribution-based one like CC-BY), you need to take into account expected usage. In some cases, for a leaf ontology, perhaps CC-BY is appropriate, as it isn’t intended to be imported by others, but you never know when your leaf will turn into something others import. Science Commons also believes that attribution is a very different beast, and shouldn’t be required when licensing data. They provided me with an answer to how to license ontologies recently that favored CC0.

So, if you really want citation and not attribution, consider an open license such as CC0 and make a gentlemanly (gentle-science-person-ly) request that if someone uses it AND publishes a paper on it, please cite it in the way you suggest. Alternatively, I’d be interested to hear if it would be possible to use an attribution-based license such as CC-BY and then require the attribution method be citation in a paper. Would this method work, and would it be polite? Your comments, please.

attribution, Creative Commons licenses, Knowledge Management, Knowledge Representation, License, Metadata, OBO Foundry, ontology, Wikipedia

5 Comments

The OBO foundry principles

Posted by peanutbutter in bioinformatics, conference, data standards, ontology, open data, open science, semantic web on June 7, 2009

This week, is a week long ontology building week, consisting of two days at the OBO Foundry workshop followed by 4 days at the OBI workshop, all hosted at the EBI. In advance of the meeting (even though I am writing this during the meeting) Duncan asked “how can the ontology development principles be improved“. Ally and Melanie responded commenting on each principle, and I would pretty much agree with every issue the ontology ladies raise. These principles should be used to guide ontology developers to build a consistent resource and which are used to “peer-review” the ontology. However, my concern is that there is no indication or recommended methodology in how these principles could be met, during the development process. This was my motivation for reviewing all the existing documented methodologies are assess there applicability (shameless plug), as I think it is important to remember that the members of the OBO Foundry are not the first people in the world to build ontologies and we should make use of known and documented expertise where possible instead of re-inventing the wheel. These are my take on the principles below. However, I would recommend reading Ally’s and Melanie’s first as I have tried not to repeat what they have already said. Although, as Duncan and the ontology ladies have independently arrived at mostly the same conclusions, there is a very real need to address or more explicitly state, these set of principles.

1.The ontology must be open and available to be used by all without any constraint other than (a) its origin must be acknowledged and (b) it is not to be altered and subsequently redistributed under the original name or with the same identifiers.

Licenses – This is always a touchy subject, within the life-sciences and IMHO largely due to a mis-understanding of what a license is actually for. Being open in the sciences is often mis-interpreted as “you can use it, but you have to attribute me”. This is actually not being open. The attribution aspect is actually a restriction. The principle of the OBO foundry is that there will be hundreds of separate and orthogonal ontologies that will all refer and reference each other. If every single ontology has to be attributed this will become a large overhead. In addition, if we do insist on attribution, how do we acknowledge the use? An official statement? Is importing the URI enough? This aspect of the principles really needs to be explicit and clearly stated. Making use of licenses that already exist may be a good starting point, rather than trying to define a bespoke OBO Foundry license. Two possibilities are Creative Commons – Attribution or CCO. The current OBO principles seem to merge these two licenses together. An explicit statement on licenses is really needed. Ally covers this in more detail on her post.

2. The ontology is in, or can be expressed in, a common shared syntax. This may be either the OBO syntax, extensions of this syntax, or OWL.

mm, this is confusing “expressed in a common shared syntax”, but you can use either OBO or OWL that would be two different syntax – no? Either the OBO foundry are in a shared syntax, or they are in OBO or OWL.

3. The ontologies possesses a unique identifier space within the OBO Foundry.

I would be good just to tighten this up a bit. A clear statement of the identification schema would be helpful

4. The ontology provider has procedures for identifying distinct successive versions.

This is a good statement and version of ontologies definitely need to be identified. As Ally mentions, we probably do not want to legislate which versioning system to use (svn, git etc). However maybe a recommendation of which to use and what constitutes a change may be helpful.

5. The ontology has a clearly specified and clearly delineated content.

How would you describe this to your users or developers? In ontology development this is traditionally called defining your scope. Your scope can be described by competency questions – questions your ontology should answer.

6.The ontologies include textual definitions for all terms.

A definition of a class in the ontology is its assertion in the hierarchy and all the logical restrictions, it can also include a natural language definition. I would re-word this to ” The classes in the ontology shoud have a natural language definition which reflection the logical definition of the class”.

7.The ontology uses relations which are unambiguously defined following the pattern of definitions laid down in the OBO Relation Ontology.

Same comment as Ally

8. The ontology is well documented.

Not really sure what this actually means or how to implement it. There are a set of naming recommendations within the OBO Foundry, is this what it is referring to? There are also metadata recommendation from OBI, is this the same thing?

9.The ontology has a plurality of independent users.

Why is this important as a principle for inclusion? Is listing on the OBO Foundry not an attempt to gain wider use? Are the computational users? is a user an individual, a lab, a project, community?

10.The ontology will be developed collaboratively with other OBO Foundry members.

Why? Is this really a guiding development policy? What does collaboratively mean? In terms of the license? Does branching in a versioning repository count as collaborative development? I would suggest that if we get the terms of the license explicit an the idea of the Foundry then this principle is probably not necessary to be stated.

These comments are more a mix questions for debate rather than any clear cut corrections.

bio-ontology, Creative Commons licenses, Knowledge Management, Knowledge Representation, Metadata, OBI, OBO Foundry, Ontologies, ontology, Peer review

11 Comments

HUPO PSI-PAR: standard format for protein affinity reagents

Posted by peanutbutter in data standards on April 7, 2009

: Image via Wikipedia

HUPO PSI-PAR: standard format for protein affinity reagents is now available for Public Comment on the PSI Web site for the next 30 days. The public comment period enables the wider community to provide feedback on a proposed standard before it is formally accepted, and thus is an important step in the standardisation process. This message is to encourage you to contribute to the standards development activity by commenting on the material that is available online. We invite both positive and negative comments. If negative comments are being made, these could be on the relevance, clarity, correctness, appropriateness, etc, of the proposal as a whole or of specific parts of the proposal. If you do not feel well placed to comment on this document, but know someone who may be, please consider forwarding this request. There is no requirement that people commenting should have had any prior contact with the PSI. If you have comments that you would like to make but would prefer not to make public, please email the PSI-Editor directly.

Antibody, data representation, Immune system, MIBBI, psi, standards

Leave a comment

The Triumvirate of Scientific Data

Posted by peanutbutter in bioinformatics, data standards, ontology, open data on October 30, 2008

In a recent Nature editorial entitled Standardizing data, several projects were highlighted that are forfeiting there chances of winning a Nobel prize (according to Quackenbush) and championing the blue collar science of data standardization.in the life-sciences.

I wanted to take the article a step further highlight three significant properties of scientific data that I believe to be fundamental in considering how to curate, standardize or simply represent scientific data; from primary data, to lab books, to publication. These significant properties of scientific data are the content, syntax, and semantics, or more simply put -What do we want to say? How do we say it? What does it all mean? These three significant properties of data are what I refer to as the Triumvirate of scientific data.

Content: What do we want to say?

Data Content is defined as the items, topics or information that is “contained in” or represented by a data object. What is, should or must be said. Generic data content standards exists, such as Dublin Core, as well as more focused or domain specific standards. Most aspects of the research life-cycle have a content standard. For example, when submitting a manuscript to a scientific publisher you are required to conform to a content standard for that Journal. For example, PlosOne calls their content standard Criteria for Publication and lists seven points to conform to.
The Minimum Information about [insert favourite technology] are efforts by the relevant communities to define content standards for their experiments. These do (should) not define how the content is represented (in a database or file format) rather they state what information is required to describe an experiment. Collecting and defining content standards for the life-sciences is the premise of the MIBBI project.

Syntax: How do we say it?

The content of data is independent of any structure, language implementation or semantics. For example when viewing a journal article on Biomed central you typically have the option to view or download the “Full Text” which is often represented in HTML or you have the option of viewing the PDF file or XML. Each representation has the same scientific content to a human but is structured and then rendered (or “presented”) to the user in three different syntax.
The majority of the structural of syntactic representation of scientific data is largely database centric. However, alternative methods can be identified such as Wikis (OpenWetWare, UsefulChem), Blogs (LaBLog), XML, (GelML), RDF (UniProt export) or described as a data model (FuGE) which can be realised in multiple syntax

Semantics: What do we mean?

The explicit meaning of data is very difficult to get right and is a difficult problem in the life-sciences. One word can have many meanings and one meaning can be described by many words. A good example of a failure to correctly determine the semantics of data is described in the paper by Zeeberg et al 2004. In the paper they describe the mis-interpretation of the semantics of gene names. This mis-interpretation of semantics resulted in an irreversible conversion to date-format by Excel and which percolated through to the curated LocusLink public repository.
Within the life-sciences the issue of semantics is being addressed via the use of Controlled vocabularies and ontologies.
According to the Neurocommons definition; A controlled vocabulary is an association between formal names (identifiers) and their definitions. A ontology is a controlled vocabulary augmented with logical constraints that describe their interrelationships. Not only do we need semantics for data, we need shared semantics, so that we are able to describe data consistently, within laboratories, across collaborations and transcending scientific domains. The OBO Foundry is one of the projects tasked with fostering the orthogonal development of ontologies – one term only appears in one ontology and is referenced by others – with the goal of shared semantics.

Summary

When considering how to curate, standardize or represent scientific data, either internally within laboratories, or externally for publication, the three significant properties of content, syntax and semantics should be considered carefully for the specific data. Consistent representation of data conforming to the Triumvirate of scientific data will provide a platform for the dissemination, interpretation, evaluation and advancement of scientific knowledge.

Acknowledgments

Thanks to Phil Lord for helpful discussions on the Triumvirate of data

Conflict of interest

I am involved in the MIBBI project, the development of GelML and a member of the OBO Foundry via the OBI project.

data curation, data representation, data standards, Dublin Core, HTML, MIBBI, OBO Foundry, ontology, RDF, representation, Resource Description Framework, RSS, science, semantic web, XML

4 Comments

MIAPE: Gel Informatics is now available for Public Comment

Posted by peanutbutter in bioinformatics, data standards, Journals Publishing, MIBBI, Proteomics on August 18, 2008

PSI logo

The MIAPE: Gel Informatics module formalised by the Proteomics Standards Initiative (PSI) now available for Public Comment on the PSI Web site. Typically alot of this information will be contained in the image analysis software, so we would especially encourage software vendors to review the document. The public
comment period enables the wider community to provide feedback on a proposed standard before it is formally accepted, and thus is an important step in the standardisation process.

This message is to encourage you to contribute to the standards development activity by commenting on the material that is available online. We invite both positive and negative comments. If negative comments are being made, these could be on the relevance, clarity, correctness, appropriateness, etc, of the proposal as a whole or of specific parts of the proposal.

If you do not feel well placed to comment on this document, but know someone who may be, please consider forwarding this request. There is no requirement that people commenting should have had any prior contact with the PSI.

If you have comments that you would like to make but would prefer not to make public, please email the PSI editor Norman Paton.

data standards, MIBBI, Proteomics, psi, reporting guidelines

Leave a comment

PEFF:A Common Sequence Database Format in Proteomics

Posted by peanutbutter in data standards, development on August 13, 2008

Image via Wikipedia

PEFF:A Common Sequence Database Format in Proteomics is now available for Public Comment on the PSI Web site (http://psidev.info/index.php?q=node/363). The public comment period enables the wider community to provide feedback on a proposed standard before it is formally accepted, and thus is an important step in the standardisation process.

This document presents a unified format for protein and nucleotide sequence databases to be used by sequence search engines and other associated tools (spectra library search tools, sequence alignment software, data repositories, etc). This format enables consistent extraction, display and processing of information such as protein/nucleotide sequence database entry identifier, description, taxonomy, etc. across software platforms. It also allows the representation of structural annotations such as post-translational modifications, mutations and other processing events. The proposed format has the form of a flat file that extends the formalism of the individual sequence entries as presented in a FASTA format and that includes a header of meta data to describe relevant information about the database(s) from which the sequence has been obtained (i.e., name, version, etc). The format is named PEFF (PSI Extended FASTA Format). Sequence database providers are encouraged to generate this format as part of their release policy or to provide appropriate converters that can be incorporated into processing tools.

This is an announcement to encourage you to contribute to the standards development activity by commenting on the material that is available online. We invite both positive and negative comments. If negative comments are being made, these could be on the relevance, clarity, correctness, appropriateness, etc, of the proposal as a whole or of specific parts of the proposal.

If you do not feel well placed to comment on this document, but know someone who may be, please consider alerting them towards this information. There is no requirement that people commenting should have had any prior contact with the PSI

community, data standards, fasta, Proteomics, psi, review, standards

Leave a comment

Double standards in Nature biotechnology

Posted by peanutbutter in bioinformatics, data standards, Journals Publishing, MIBBI on August 7, 2008

OK, So that is a relatively inflammatory and controversial headline, edging on the side of tabloid sensationalism. What is refers to is probably a situation that I may never find myself in again, which is in this months edition of Nature Biotechnology I am an author on two, biological standards related publications.

The first is a letter advertising the PSI’s MIAPE Guidelines for reporting the use of gel electrophoresis in proteomics. This letter is also accompanied by letters referring to the MIAPE guidelines for Mass Spectrometry, Mass Spectrometry Informatics and protein modification data.

The second is a paper on the Minimum Information about a Biomedical or Biological Investigations (MIBBI) registry entitled Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project.

The following press release describes this paper in more detail.

More than 20 grass-roots standardisation groups, led by scientists at the European Bioinformatics Institute (EMBL-EBI) and the Centre for Ecology & Hydrology (CEH), have combined forces to form the “Minimum Information about a Biomedical or Biological Investigation” (MIBBI) initiative. Their aim is to harmonise standards for high-throughput biology, and their methodology is described in a Commentary article, published today in the journal Nature Biotechnology.

Data standards are increasingly vital to scientific progress, as groups from around the world look to share their data and mine it more effectively. But the proliferation of projects to build “Minimum Information” checklists that describe experimental procedures was beginning to create problems. “There was no way of even finding all the current checklist projects without days of googling,” says the EMBL-EBI’s Chris Taylor, who shares first authorship of the paper with Dawn Field (CEH) and Susanna-Assunta Sansone (EMBL-EBI). “As a result, much of the great work that’s going into developing community standards was being overlooked, and different communities were at risk of developing mutually incompatible standards. MIBBI will help to prevent them from reinventing the wheel.

The MIBBI Portal already offers a one-stop shop for researchers, funders, journals and reviewers searching for a comprehensive list of minimum information checklists. The next step will be to build the MIBBI Foundry, which will bring together diverse communities to rationalise and streamline standardisation efforts. “Communities working together through MIBBI will produce non-overlapping minimal information modules,” says CEH’s Dawn Field. “The idea is that each checklist will fit neatly into a jigsaw, with each community being able to take the pieces that are relevant to them.” Some, such as checklists describing the nature of a biological sample used for an experiment, will be relevant to many communities, whereas others, such as standards for describing a flow cytometry experiment, may be developed and used by a subset of communities.

“MIBBI represents the first new effort taking the Open Biomedical Ontologies (OBO) as its role model”, says Susanna-Assunta Sansone. “The MIBBI Portal operates in a manner analogous to OBO as an open information resource, while the MIBBI Foundry fosters collaborative development and integration of checklists into self-contained modules just like the OBO Foundry does for the ontologies”.

There is a growing understanding of the value of such minimal information standards among biologists and an increased willingness to work together across disciplinary boundaries. The benefits include making experimental data more reproducible and allowing more powerful analyses over diverse sets of data. New checklist communities are encouraged to register with MIBBI and consider joining the MIBBI Foundry.

Press release issued by the EMBL-European Bioinformatics Institute and the Centre for Ecology and Hydrology, UK.

Biology, Centre for Ecology and Hydrology, data standards, European Bioinformatics Institute, gel electrophoresis, MIAPE, MIBBI, Nature, Nature Biotechnology, OBO Foundry, Open Biomedical Ontologies, Organizations, publishing, reporting guidelines, UK, United Kingdom

Leave a comment

CARMEN – A Scalable Science cloud

Posted by peanutbutter in bioinformatics, CARMEN, cloud, conference, data standards, neuroinformatics, open science, video on June 25, 2008

Paul Watson presents a talk on CARMEN a the Google Seattle Conference on Scalability.

cloud computing, conference, Google, neuroinformatics, neuroscience, Paul Watson, scalability, Seattle

2 Comments

Proteomics Standards Initiative recommendations

Posted by peanutbutter in bioinformatics, data standards, development, Proteomics, Uncategorized on June 12, 2008

Several new standard artefacts have progressed through the public consultation process of the PSI. They are the MIAPE Column Chromatography document and the Mass Spectrometer Markup Language Specification (mzML).

Leave a comment

fgibson.com

Archive for category data standards

DITA – A framework for scientific publishing?

Attribution vs Citation: Do you know the difference?

The OBO foundry principles

HUPO PSI-PAR: standard format for protein affinity reagents

The Triumvirate of Scientific Data

MIAPE: Gel Informatics is now available for Public Comment

PEFF:A Common Sequence Database Format in Proteomics

Double standards in Nature biotechnology

Proteomics Standards Initiative recommendations

Archives

License

Top Posts

Archive for category data standards

Related articles

Archives

License

Top Posts