I suggest you ...

Improve quality of automatic metadata extraction

Improve the quality of the automatic metadata extraction; add automatic retrieval of metadata from arXiv, PubMed etc.

1,805 votes
Vote 0 votes Vote Vote
Vote
Sign in
Check!
(thinking…)
Reset
or sign in with
  • facebook
  • google
    I agree to the terms of service

    You'll receive a confirmation email with a link to create a password (optional).

    Signed in as (Sign out)
    You have left! (?) (thinking…)
    MendeleyAdminMendeley (Admin, Mendeley) shared this idea  ·   ·  Flag idea as inappropriate…  ·  Admin →
    started

    146 comments

    Sign in
    Check!
    (thinking…)
    Reset
    or sign in with
    • facebook
    • google
      I agree to the terms of service

      You'll receive a confirmation email with a link to create a password (optional).

      Signed in as (Sign out)
      Submitting...
      • albertoalberto commented  ·   ·  Flag as inappropriate

        Please consider giving us an option to manually re-search for an article entry if it is wrong and to use other services other than google scholar. Papers 2 seems to handle this well and allows you to select which services to search for metadata.

      • splashsplash commented  ·   ·  Flag as inappropriate

        some publishers like acs provide the DOI for a paper in the link returned from google scholar. why not automatically scan the returned links for a DOI , and if present import it.

      • ReeceReece commented  ·   ·  Flag as inappropriate

        If the article is in PubMed, the doi and the pubmed id should be stored with the record. Google Scholar search doesn't provide that (apparently).

      • J.H.D.J.H.D. commented  ·   ·  Flag as inappropriate

        Seems to me that metadata import/management is *the* (potentially) distinguishing feature on which Mendeley eventually succeeds or fails. While users will give you a break for awhile, in steady state the "acceptable" bar here will be very high, and doing it badly will eventually be fatal.
        Closely related is the ease with which users can fix incorrect data. I just imported my library of several thousand documents -- hundreds of them should have been read correctly but were mangled, and hundreds more are talks or whatever that I'd like to catalog but don't expect to be automatic. I'd grit my teeth and fix them if it were slick/fast, but it's slow and clumsy. So, give up, or wait and hope that the UI gets better?...

      • Carl AndersonCarl Anderson commented  ·   ·  Flag as inappropriate

        Mendeley's ability to garble metadata gleaned from JSTOR is particularly odd. Unless the paper has an easily grabbable DOI (which of course many oder papers do not), Mendeley makes bizarrely erroneous guesses -- which is hard to understand, given that JSTOR provides pretty much all the relevant metadata in a fairly standardized format even with the PDFs. Yet Mendeley usually scoops up the "added" date, rather than the "published date", or interprests the "published date" as an issue number, or piles most of the metadata (including the stable URL, which ought to be quite machine recognizeable!) into the title .... etc. This sort of thing ought to be reasonably easy to fix--or so I would imagine.

      • Carin BassonCarin Basson commented  ·   ·  Flag as inappropriate

        I've found that Plant Physiology and PNAS metadata extraction is prone to problems. Both have their bibliographic information at the bottom of the page.

      • Carl AndersonCarl Anderson commented  ·   ·  Flag as inappropriate

        I am surprised that Mendeley doesn't do a better job of using its own database of papers etc. to aid with identifying the correct metadata. It seems to me that a good behaviour would be, when a new item is added, that Mendeley tries to identify things like author and title in order to search existing entries already known within Mendeley, and presents the user with some options ("Is your new entry this paper, this paper, or this paper?"), of which the user can select the right one.

      • ebiomanebioman commented  ·   ·  Flag as inappropriate

        Just found a minor inconvenience related to metadata extraction:
        All pages are always "incomplete" e.g. 273-95 instead of 273-295
        I assume this has as well some advantages leads on the other hand to confusion regarding more complex literature.
        I would prefer if we could decide about that in the settings option menu or something similar

      • Maurizio PaolilloMaurizio Paolillo commented  ·   ·  Flag as inappropriate

        Please, allow to recover metadata using NASA ADS (and maybe arXiv). They are much more accurate than Google Scholar for Astrophysical publications.

      • VicentVicent commented  ·   ·  Flag as inappropriate

        Please, Mendeley staff, copy the way "Reference Manager v12" does it: you can use a search box directly in the desktop program interface, and it searches in all the available data bases. The information/metadata retreival is said to be good.

      • Yoriko YYoriko Y commented  ·   ·  Flag as inappropriate

        The metadata extraction should be made to work with languages other than English as well, especially those with non-Roman characters. Right now, Mendeley produces nonsensical metadata (if at all) for most of the documents I import that aren't in English.

      • MarkusMarkus commented  ·   ·  Flag as inappropriate

        ACM digital library import does also not work 100% correctly. It seems to me that, when using the Bookmarklet, publication titles are only imported up to the first colon (if there is any). For example http://dl.acm.org/citation.cfm?id=985692.985698 : using the bookmarklet, the title "Caretta" is imported, even though it's "Caretta: a system for supporting face-to-face collaboration by integrating personal and shared spaces"

      • lukeblukeb commented  ·   ·  Flag as inappropriate

        NASA ADS: the javascript button to import from NASA ADS often imports the comma between multiple authors. This comma ends up in the .bib file, and then Bibtex won't run properly until you manually delete the comma.

      • YvesYves commented  ·   ·  Flag as inappropriate

        For papers from IOP (Institue of Physics), the first page is usually a non-sensical list of useless information. Therefore, during automatic import, Mendeley usually cannot correctly extract the journal information. It'll be great if Mendeley could SMARTLY skip the first page.

      ← Previous 1 3 4 5 6 7 8

      Knowledge Base and Helpdesk