Freebase
Start typing to get some suggestions
  • Explore
  • Use
  • Build
  • Developers

        Bird Info

        Discussions on Bird Info

        Domains & Types » Bird Info » Discuss

        Start a New Discussion

        Discussion will be posted in:

        • Bird Info

        Think this discussion also relates to something else? Cross-post it by adding a new discussion area:

        • General Support,
        • Developer Support
        1.  

          AOU Type

          23 posts, latest post: spatialed, Dec 8, 2008
          Link to discussion
          1. xtine Metaweb Staff
            Oct 9, 2008
            xtine says:

            hi, spatialed.  i have noticed that birds some bird taxa in Freebase have a different topic name than the name used in the AOU lists. 

            jg recommended a separate AOU classification type, in which i could store data such as the AOU name and other data, such as its AOU abbreviations and any other data you think would be useful. we could also make the taxa ordered lists, so that we could force the list to be displayed in the way birders here would expect.

            i see you already have some types for AOU data that maybe could be turned into properties of one AOU type, which we could then use as a co-type all the birds in the area covered by the AOU.   since you are already familiar with the AOU data and are working on AOU types, i was wondering if you wanted to take a crack at this.

            1. spatialed Top Contributor Freebase Experts
              Oct 9, 2008
              spatialed says:
              That's funny you should write this right now. I noticed your question as I was about to start this exact type. I was wondering how to organize it and agree that an AOU type would be a good way to go. I'll take a crack at it. Let me know if you think anything should be added or changed.
            2. xtine Metaweb Staff
              Oct 9, 2008
              xtine says:
              great -- looking forward to seeing what you come up with. i will probably be resolving names for a little while yet, but when that is done, i can help populate that type.
            3. spatialed Top Contributor Freebase Experts
              Oct 9, 2008
              spatialed says:

              My suggestion:

              I don't see any way to do this except to add an AOU taxonomy CVT for each bird taxon’s English name. The CVT would link the AOU English name to its organism rank, Latin name, AOU edition, and AOU supplement. Both the English and Latin names can change over time so all this information needs to be stored together. Following the current Freebase format, the English name should be the topic described by the CVT. The higher and lower AOU taxa ranks probably need to be included in the AOU CVT too.

              If the AOU English name is the same as the ITIS English name then the topic will have both types. If the AOU name is different than ITIS then it will only have one or the other type. There is no way to properly "resolve" these different taxonomic systems. They are similar but different and both change over time. The changes within each system need to be maintained. Perhaps a “most similar ITIS taxon” property is needed, especially for those bird topics that do not have the same AOU and ITIS common names. Alternatively, we could use a more generic “equals”, “overlaps”, “contains”, and “contained by” properties for defining relationships among taxon topics associated with multiple taxonomies.

              I've brought this non-overlapping category issue up a few times now under the umbrella of "crosswalks" in various discussions, including those related to taxonomy. I still think they are a necessity. The world is categorized by different people in different ways. In this case, ITIS circa 2007 is not going to work as a one-size-fits-all taxonomic system. There are dozens of other taxonomies, including AOU. Freebase provides an amazing tool for describing relationships among taxonomies and other categorical classification systems. This ITIS/AOU conflict provides yet another example of how categories can be crosswalked in Freebase. Land cover classification schemes provide another.

              I'm working on an AOU taxonomy type but the current version of the UI is stymieing me again by not allowing me to delete properties or change the key. The draft is: http://www.freebase.com/type/schema/user/spatialed/bird_codes/aou  Ignore the properties below AOU supplement. The AOU code properties will need to be added separately, possibly within an AOU bucket type that contains the taxonomy CVT and a standard type with additional properties that do not need to be directly linked to the taxonomy.

            4. spatialed Top Contributor Freebase Experts
              Oct 9, 2008
              spatialed says:
              I updated the type and changed the key. Check out AOU taxonomy and let me know what you think. Unfortunately it's complex but I think that might be a necessity to keep all the dated information together. A simpler approach would be to update the names everytime a supplement to the AOU check-list came out. However, that would require someone committed to doing that. Also, the historical information would be lost.
            5. spatialed Top Contributor Freebase Experts
              Oct 9, 2008
              spatialed says:
              Sorry. This is the updated link. The previous one will probably be recycled as the bucket for the taxonomy type, codes, and anything else AOU related.
            6. xtine Metaweb Staff
              Oct 10, 2008
              xtine says:

              Ugh.... I would really rather go with a co-type, and not try to duplicate the hierarchical relationships.  I was envisioning a type that would only have AOU-specific properties.  I realize there may be some differences in the relationships, but I think they can be modelled without having to duplicate all the info.  Perhaps when I finish resolving the names, we can get some data on how hairy the conflicts are.  I don't think there will be so many differences that  mirroring the Organism Classification type will be warranted.

            7. spatialed Top Contributor Freebase Experts
              Oct 10, 2008
              spatialed says:

              So... after playing with the AOU taxonomy type for awhile I wasn't happy with it.

              I decided to let the existing Freebase English bird names drive the links between common names (linked to ITIS codes) and AOU English names. Each English bird name is type as "AOU English name" type. This type contains an "AOU Latin name" property/type that has disambiguation for taxon rank (e.g., species) and source (i.e., an edition of the AOU checklist and possibly supplement). For each AOU English name, one or more AOU Latin names can be added to track changes. For each Latin name, only one English name can be added. This means that there could be duplicate Latin names if one was changed in say the 5th edition of the checklist and then changed back in the 7th edition. Disambiguation is used to distinguish the similar Latin names topics by edition/supplement.

              So far this seems to work alright. The English names are simple to update. The Latin names are where the effort must be made to keep track of changes.

              Because of the way this is structured, Latin names are automatically reciprocated back to the checklist editions and supplements, which is great. Common names, however, must be reciprocated through additional properties. This adds some denormalization but I can't figure out how to get around that yet.

            8. spatialed Top Contributor Freebase Experts
              Oct 10, 2008
              spatialed says:

              Update: I have added properties for previous and revised English and Latin names. This is so far the best way I can figure out how to explictly make those connections.

              Also note that the AOU Latin names can only be linked to 1 edition of the checklist/supplement whereas common names can be linked to multiple. My thinking is that the Latin name will be used to identify the first case in which the Latin-English combo were used (only the American Flamingo data currently represent this relationship). The common name on the other hand can hold all instances of association with Latin names, editions, and supplements.

            9. spatialed Top Contributor Freebase Experts
              Oct 10, 2008
              spatialed says:
              Minor revision: Latin name could be used to describe a specific instance of a taxon description (e.g., the latest edition of the AOU checklist). This will allow you to resolve ITIS English names with AOU English names linked to a single instance of the Latin name. Additional Latin names can later be added to the English names to describe changes that were made in more recent supplements or to document previous Latin names in earlier editions and supplements.
            10. spatialed Top Contributor Freebase Experts
              Oct 10, 2008
              spatialed says:
              I've filled in a bit of data from the 49th supplement. The AOU Latin name and AOU English name types seem to be holding up well, including list imports. The supplement contains a lot of name and taxonomy changes that conflict with the ITIS data so it was a good test for how well those differences can be documented. I also added a couple text properties to document reasons for changes in nomenclature and taxonomy. This might become an enumerated list down the road.
            11. jeff Metaweb Staff
              Oct 13, 2008
              jeff says:

              I don't pretend to know much about the ins and outs of differing biological taxonomies, but I'm more than a bit concerned about having multiple topics for different classifications of the same organism, for example here, here, here, and here. These are all the same bird, as far as I can tell, and two of them are even the same name.  Users who want to link to this species -- Christine's checklist, say, or as the subject of a paper, will have to arbitrarily choose one, and anyone looking for a list of things about flamingoes will have to query against an unknown number of topics to get any kind of useful list.

            12. xtine Metaweb Staff
              Oct 13, 2008
              xtine says:

              I agree with Jeff.   I think that the  AOU Latin and common names should be text properties of an AOU Taxon type, which would be used only as a co-type to existing topics.  I'll take a crack at this sometime this week.  

            13. jg Metaweb Staff
              Oct 14, 2008
              jg says:

              +1, we should have one topic for one concept, and co-type as required for AOU or whatever other data we wish to capture for that concept.   

              What are the examples of AOU and ITIS taxonomy (not just names) conflicting?  It could be that we just have not added taxa aliases yet... 

            14. spatialed Top Contributor Freebase Experts
              Oct 14, 2008
              spatialed says:

              I stick with my comments above for why your simple approach, while very desirable, is not appropriate.

              I have tried to link to existing topics whenever possible but it is not always possible. I have also tried to provide documentation when links with existing topics are not appropriate (read above as well as the description and data for those topics). Jeff's reference to American Flamingo (the "here, here, here, and here") related topics is a great example. See this discussion on some of the historical merges and splits of this taxon/taxa with P. roseus in the 80's and 90's. How can this historical information be documented in the current schema? As an alias? That's not very useful. I'd like to see historical information summarized in Freebase rather than see Freebase turn into an encyclopedia limited to a single taxonomy at a single point in time with aliases that may or may not be appropriate depending on the date.

              What about a hypothetical but possible example of taxa that were split, where one side of the split keeps the Latin name and the other doesn't but both get new English names?

              What about a hypothetical but possible example of three taxonomic classification systems for plants, where one describes a taxon, the second splits it into two taxa, and the third splits it into two taxa but merges the second with a third that is not included in the first. Worse yet, each of these three classification systems may use the same English name for one of the taxa but different Latin names. Now what?

              I agree that we should only have one topic for a concept but there are few universal taxonomic concepts, especially if considered since Linneaus.To model dynamic and sometimes conflicting taxonomic systems and relationships, I did exactly what jg suggested: "have one topic for one concept, and co-type as required for AOU or whatever other data we wish to capture for that concept." My approach may not be the best way to model these relationships but I don't make up data or relationships to fit a simple mold. It seems that's what you want to sometimes by funneling conflicts into aliases.

            15. jeff Metaweb Staff
              Oct 14, 2008
              jeff says:

              I don't have any brilliant ideas regarding the splitting and joining of taxa, alas. I do however, think that mere name changes should not be recorded as new entities, any more than a company or person changing their name should result in a new topic.  Company uses a simple CVT to record name changes -- a text string with the name, with start and end dates.  Al created a "thing of many names" type does the same thing generically. I recommend something like these for name changes where no other changes to the classification occur.

            16. crism Metaweb Staff
              Oct 15, 2008
              crism says:

              I believe that with scientific data, including taxonomy, we should attempt to model the status quo of understanding of reality. Business really do change names, merge, etc.; species don’t, but our understanding of them changes. The main taxonomical topics can use aliases for a kind of history, but should not IMO attempt to rigorously model all the changes in our understanding of them. If someone (and yes, I am looking at you, Ed) wants to model taxonomical history, go for it, but the primary taxonomies should be unitary.

            17. spatialed Top Contributor Freebase Experts
              Oct 15, 2008
              spatialed says:

              Aye Aye. I'm interested in your interpretation of the status quo. There have been eight supplements to the 7th edition of the AOU check-list since it was published in 1998. Are these changes to be ignored or are the historical data to be erased? For example, there is no longer a Caribbean Flamingo so what are you going to do with that topic? Just add an alias? Currently you keep the defunct name for the topic and add the new name as an alias. This is an issue for 12 other taxa too: 

               

              "13 English names are changed (Phoenicopterus ruber becomes American Flamingo, two species of Colibri become Violetear rather than Violet-ear, Goethalsia bella becomes Pirre [rather than Rufous-cheeked] Hummingbird, Cnipodectes subbrunneus becomes Brownish Twistwing rather than Brownish Flycatcher, six species of Turdus become Thrush rather than Robin, Chlorothraupis carmioli becomes Carmiol’s [rather than Olive] Tanager, and Troupial becomes Venezuelan Troupial)"

               

               A little more difficult is that "two genera (Lysurus and Buarremon) are lost by merger (with Arremon);" Are the historcal genera to be deleted OR is the new genus to be ignored OR is the new genus to be added as an alias to both defunct genus topics OR are the historical genera to be merged into the new genus, added as aliases, and all documentation (which might have conflicts) merged into the new genus topic?

               

              Similarly, "18 scientific names are changed by transfer from one genus to another".  Will the new genus be added as an alias to the old one? Will the higher classification be changed to the new genus? If the higher classifcation is changed, will the old genus (or at least its links) be deleted? 

               

              What happens when someone wants to add data for a species that is no longer considered a species (e.g., name changed in the 1800s) and is therefore not included in Freebase? Are they supposed to link it to the closest existing taxon topic and list the old name as an alias even if the historical taxonomic concept was quite different than the existing one?


              I'm not trying to be difficult. I'm just really curious as to how there could be unitary primary taxonomies, who decides what those are, will they ever be updated, if so - when and by whom, what will happen to the historic data during updates, ...
            18. spatialed Top Contributor Freebase Experts
              Oct 20, 2008
              spatialed says:

              Thought I would share a comment on this discussion subject from a person who's job is to lead the development of large international conservation projects:

              " On taxonomy, you're right.  If the Freebase folks don't appreciate the complexities in taxonomy, they will not be prepared to accept many other complexities in this messy world.  Frankly, I'm surprised - sounds naive to me."

              Chris, I'm pretty happy with my model for AOU taxonimical history. The biggest issue I have with it is the creation of duplicate Latin names when there are only changes in English names. It just needs data now, which should be possible to scrape from the AOU editions but would need to be entered by hand from the supplements. I'll enter data from time to time as needed for projects. Hopefully others will too. 

            19. crism Metaweb Staff
              Oct 20, 2008
              crism says:

              This is your domain, and obviously you can make the model as complex or as simple as you want. If I were doing it, as I said, I would only attempt to capture the status quo, and just drop the history, using aliases if necessary. If you want to model the history, then go ahead. We do appreciate the complexity—and that’s why I, at least, would not attempt to model it right now. Let your friend call it naïve when he’s tried it himself. (-:

            20. jeff Metaweb Staff
              Oct 20, 2008
              jeff says:

              I have no issue with the attempt to model the complexity of a taxonomy -- I'm all for it.  And maybe what Freebase can supply for now is a template to which more complex structures can be appended.

              But I share your concern with having to create multiple Latin names when only the English name has changed (and presumably vice versa).  Actually, I have a concern with creating new topics based solely on name-changes in general, but this one is particularly odd.  In cases where there is no change in what is being described, merely the way of describing it, why not just use a CVT linked from an "AOU classification" type, that would contain these properties: Latin name, English name, AOU checklist edition, Supplement, and Reason for change (with the names being text strings). You could then have properties to describe splits and mergers specifically. The current AOU model doesn't seem to allow for this useful distinction, other than as text string note. Right now, for example, you can't necessarily tell whether the "American flamingo" and "Carribean flamingo" are different names for the same organism, or two different ways of splitting up the organisms.

              That's my $.02, anyway.

            21. spatialed Top Contributor Freebase Experts
              Oct 21, 2008
              spatialed says:

              Thanks for the ideas Jeff. I'll rethink the model and try a few other things. I did try a simple CVT approach first but I couldn't figure out a better way to permit tracking of changes in classification ranks and nomenclature. My current model does explcitly state that American Flamingo is the revised name of Caribbean Flamingo. The reason for revision is a text box but I see that as becoming an enumerated list down the road. Properties that link topics over the evolution of the taxonomy include previous name and revised name for both English and Latin names as well as higher taxon and lower taxon for Latin names. Nomenclature changes without taxonomic changes (e.g., Caribbean to American Flamingo) are probably in the minority so I would rather keep the model more inclusive of all changes and only consider this kind as a special case that can be captured by the model.

            22. spatialed Top Contributor Freebase Experts
              Dec 8, 2008
              spatialed says:

              Some food for thought: http://www.vegbank.org/vegdocs/design/planttaxaoverview.html

          Discussion is posted in:

          • close Bird Info

          Think this discussion also relates to something else? Cross-post it by adding a new discussion area:

          • General Support,
          • Developer Support

        Search Discussions

        ©2009  Metaweb
        • Page History
        • RDF
        • Feedback
        • Attribution Policy
        • Terms of Service
        • About Us
        • Jobs
        • Freebase Blog
        Freebase contains information on:
        • Arts & Entertainment
        • Products & Services
        • Science & Technology
        • Society
        • Special Interests
        • Sports
        • Time & Space
        Dev Tools
        Refresh cache | Query Editor | Normal view | Explore | Explore2 | Admin view | View transaction log | Suggest transaction log | Client transaction log | hide (F8) | debug-level
        TID(s):
        Controller: 0.279s
        Template: 0.133s
        Cost: br=23.0, cc=0.375, ch=0.0, cm=0.0, cm+h=0.0, cr=0.0, cs=8.0, cw=4.0, dr=5409.0, dt=0.422, dw=0.0, gqr=0.0, in=3574.0, ir=40.0, iw=0.0, mcs=0.0, mcu=0.067, mr=4.0, nivcsw=78.0, nreqs=9.0, nvcsw=101.0, pf=0.0, pr=0.0, stime=0.003, te=0.042, tf=0.055, tg=0.047, tm=0.13, tr=0.039, ts=0.0, tu=0.032, utime=0.371, va=13874.0