-
Notifications
You must be signed in to change notification settings - Fork 0
Goals and progress
Page for setting goals and tracking project progress, updated weekly
- Test and merge support for multiple external specifiers.
- Complete and submit fix on Phyloref statuses.
- Finish Curation Tool UI updates.
- Incorporate changes from first major revision of Phyloref Ontology into phyx.js.
- Submit stickers for printing.
- Finish providing additional information during phyloref testing and incorporate previously curated phyloreferences.
- Delete phyx2owl from Clade Ontology.
- Add previously curated phyloreferences.
- Finish and submit script to synthesize into a single combined ontology.
- Respond to comments on behavior-based tests.
- Research sticker companies and choose company to use.
- Explored the practicality of using Vue CLI to break up the Curation Tool into components.
- Worked on providing additional information during phyloref testing as an intermediate step to incorporating previously curated phyloreferences.
- Submit travel grant application for US2TS 2019
- Finish and submit behavior-based tests for review.
- Go through first major revision of Phyloref Ontology
- Incorporated improvements to phyx.js and JSON-LD generation.
- Tested changes in Phyloref Ontology generation.
- Delete phyx2owl from Clade Ontology.
- Add previously curated phyloreferences.
- Finish and submit script to synthesize into a single combined ontology.
- Finish Curation Tool UI updates.
- Wrote draft of blog post on the Durham trip.
- Refactored JSON-LD generation in phyx.js.
- Submitted paperwork for Durham trip.
- Replaced Python test suite with a Node.js test suite.
- Improved JPhyloRef to read content from STDIN and to read JSON-LD directly.
- Finish adding the PHYX-to-JSON-LD converter to the Clade Ontology test suite (clade-ontology#47).
- Finish pull requests for jphyloref#36.
- Follow up on the sticker and T-shirt designs.
- Finish basic test suite for JPhyloRef as well as support for JSON-LD in TestCommand.
- Started working on the JavaScript PHYX-to-JSON-LD converter to the Clade Ontology test suite, including required changes to the Curation Tool.
- Continue working on JPhyloRef and Clade Ontology with a goal of producing lists of passing and failing phyloreferences and producing the Clade Ontology as an OWL ontology
- Decide whether to retain the Python library or to replace it with the Javascript library for converting PHYX to JSON-LD.
- Extend JPhyloRef to support running tests directly in JSON-LD rather than RDF/XML.
- Extend JPhyloRef/Python library/Javascript library to support validating and categorizing phyloreferences.
- Start writing scripts to combine JSON-LD files into a single ontology.
- Polish JPhyloRef PRs and submit for review: phyloref/jphyloref#30, phyloref/jphyloref#24
- Get PDFs for our sticker and T-shirt design from Kim Schoonover (who designed our logo)
- Make requested fixes to phyloref/curation-tool#109
- Made a list of information in Regnum that should be included in the Clade Ontology: phyloref/clade-ontology#46
- Finish cleaning up JPhyloRef: PR #24, PR #30, PR #32
- Continue working on JPhyloRef and Clade Ontology with a goal of producing lists of passing and failing phyloreferences and producing the Clade Ontology as an OWL ontology
- Get PDFs for our sticker and T-shirt design from Kim Schoonover (who designed our logo)
- Submit PRs for review once previous PRs have been reviewed: PR #109
- Complete immigration paperwork to allow me to work for another two years
- Work on the Clade Ontology: improve JPhyloRef and the test suites, produce lists of passing and failing phyloreferences and start producing the Clade Ontology as an OWL ontology
- Get PDFs for our T-shirt design from Kim Schoonover (who designed our logo)
- Polished several Curation Tool pull requests: PR #105, PR #107, PR #108, PR #110, PR #111
- Started cleaning up JPhyloRef: PR #26, PR #29, PR #33,
- Complete immigration paperwork to allow me to work for another two years
- Figure out why we need a workaround for datatype/annotation terms
- See if there's any value in curating phyloreferences by hand
- Start working on curator-focused improvements for the Curation Tool
- Test and document curated phyloreferences
- Improve new term requests to the Phyloref Ontology
- Complete immigration paperwork to allow me to work for another two years
- Finalize reasoning in the Curation Tool
- Test and document curated phyloreferences
- Improve new term requests to the Phyloref Ontology
- Complete immigration paperwork to allow me to work for another two years
- Complete SPNHC/TDWG meeting blog post outline
- Complete pull requests for visual improvements made to the Curation Tool in New Zealand: phyloref/curation-tool#100 (still in review), phyloref/curation-tool#102, phyloref/curation-tool#103
- Improve new term requests to the Phyloref Ontology
- Finalize reasoning in the Curation Tool: phyloref/curation-tool#104, phyloref/curation-tool#105, phyloref/curation-tool#106, phyloref/jphyloref#25, phyloref/jphyloref#26
- Test and document curated phyloreferences
- Improve new term requests to the Phyloref Ontology
- Figure out why we need a workaround for datatype/annotation terms
- Finalize reasoning in the Curation Tool
- Test and document curated phyloreferences
- See if there's any value in curating phyloreferences by hand
- Start working on curator-focused improvements for the Curation Tool
- Finish Durham trip paperwork
- Continue working on TDWG presentation and send out a draft of the Powerpoint before Thursday.
- Improve new term requests to the Phyloref Ontology
- Organize embargoed phyloreferences and submit for review. Will require the faster speed from converting the Clade Ontology to use the FaCT++ reasoner for improved speed (phyloref/clade-ontology#43)
- Get to New Zealand in one piece.
- Polish and present the next draft of the TDWG presentation
- Improved the Curation Tool interface for TDWG presentation
- Looked through some ontologies and thought through proposed changes to the Phyloref Ontology.
- Fixed some bugs in JPhyloRef (phyloref/jphyloref#22, phyloref/jphyloref#23) and submitted a PR for review (phyloref/jphyloref#17).
- Improve new term requests to the Phyloref Ontology
- Organize embargoed phyloreferences and submit for review. Will require the faster speed from converting the Clade Ontology to use the FaCT++ reasoner for improved speed (phyloref/clade-ontology#43)
- Completed and presented a first draft of the TDWG presentation
- Change the IRI to reflect the new Phyloref Ontology (as per phyloref/phyloref-ontology#21) in phyloref/clade-ontology#41.
- Added the FaCT++ reasoner as an optional reasoner to JPhyloRef (https://github.com/phyloref/jphyloref/pull/19)
- Starting moving the Clade Ontology to the FaCT++ reasoner for improved speed (phyloref/clade-ontology#43)
- Improve new term requests to the Phyloref Ontology
- Organize embargoed phyloreferences and submit for review -- pretty much ready to go, except for all the failures because of missing specifiers. Would be nice to fix that before submitted this PR.
- Work on TDWG presentation.
- Come up with an OWL restriction representation of a phyloreference that we can show in our TDWG presentation (as per phyloref/curation-tool#64) -- in progress
- Change the IRI to reflect the new Phyloref Ontology (as per phyloref/phyloref-ontology#21) in phyloref/clade-ontology#41.
- Improve new term requests to the Phyloref Ontology
- Organize embargoed phyloreferences and submit for review -- pretty much ready to go, except for all the failures because of missing specifiers. Would be nice to fix that before submitted this PR.
- Made improvements to phyloref/curation-tool#77 and resubmitted for review.
- Added support for FaCT++ and submitted for review as phyloref/jphyloref#20.
- Fixed a minor issue in JPhyloRef in phyloref/jphyloref#19.
- Change IRIs across all tools to reflect the new Phyloref Ontology (as per phyloref/phyloref-ontology#21) and submit pull requests -- in progress in phyloref/jphyloref#16, phyloref/curation-tool#82 and phyloref/clade-ontology#41.
- Polish and review Curation Tool pull requests (phyloref/curation-tool#73, phyloref/curation-tool#77) and submit for review
- Started updating the user interface to fix some bugs (phyloref/curation-tool#86)
- Improve new term requests to the Phyloref Ontology
- Organize embargoed phyloreferences and submit for review -- pretty much ready to go, except for all the failures because of missing specifiers. Would be nice to fix that before submitted this PR.
- Organize embargoed phyloreferences and submit for review.
- Improve new term requests to the Phyloref Ontology
- Polish and review Curation Tool pull requests (phyloref/curation-tool#73, phyloref/curation-tool#77) and submit for review.
- Finished and merged PR for JPhyloRef webserver (phyloref/jphyloref#12, phyloref/jphyloref#13).
- Helped Hilmar update Phyloref Ontology IRIs so they are resolvable (phyloref/phyloref-ontology#21).
- Summarized information on curated phyloreferences for our meeting.
- Planned updated user interface for selected clades (phyloref/curation-tool#12).
- Continue testing of the git-crypted directory for embargoed phyloreferences and submit for review.
- Improve new term requests to the Phyloref Ontology
- Review Curation Tool JSON-LD export (phyloref/curation-tool#73) and submit for review.
- Update JPhyloRef to run an HTTP server, accepting JSON-LD files and producing reasoning results in JSON (phyloref/jphyloref#12)
- Modify the Curation Tool to be able to send JSON-LD requests to JPhyloRef and then to update the user interface to report on where phyloreferences resolved. (phyloref/curation-tool#77)
- Update pull request to update the directory structure of the Clade Ontology and README files and resubmit for review.
- Address Hilmar's comments on separating Curation Tool components into classes
- Follow up on outstanding pull requests: phyloref/curation-tool#63, phyloref/clade-ontology#35.
- Improve new term requests to the Phyloref Ontology
- Review Curation Tool JSON-LD export (phyloref/curation-tool#73) and submit for review.
- Update JPhyloRef to run an HTTP server, accepting JSON-LD files and producing testing results in JSON.
- Modify the Curation Tool to be able to send JSON-LD requests to JPhyloRef and then to update the user interface to report on where phyloreferences resolved and whether they resolved as expected.
- Test and incorporate Rebecca and Guanyang's PHYX submissions into the Clade Ontology repository using git-crypt (phyloref/clade-ontology#38)
- Updated PR for replacing the Phyloref Ontology so it can be reasoned over by JFact 5 and resubmitted for review.
- Created a PR for translating the Phyloref Ontology into OWL Functional Syntax and submitted for review.
- Test and incorporate Rebecca and Guanyang's PHYX submissions into a private folder (phyloref/clade-ontology#38)
- Start work on incorporating reasoning into the Curation Tool (phyloref/curation-tool#3) -- resulted in pull requests phyloref/curation-tool#73, phyloref/jphyloref#10, phyloref/phyloref-ontology#19
- Improve new term requests to the Phyloref Ontology
- Improve new term requests to the Phyloref Ontology
- Refactor some Curation Tool code into classes in order to eliminate some complex methods and to squash some final ESLint issues -- filed as phyloref/curation-tool#63.
- Add a test suite to the Curation Tool -- filed as phyloref/curation-tool#63.
- Merged pull requests (phyloref/curation-tool#66, phyloref/clade-ontology#32, phyloref/jphyloref#6, phyloref/jphyloref#9)
- Start work on incorporating reasoning into the Curation Tool
- Refactor some Curation Tool code into classes in order to eliminate some complex methods and to squash some final ESLint issues
- Add a test suite to the Curation Tool
- Improve new term requests to the Phyloref Ontology
- Update README for Clade Ontology as phyloref/clade-ontology#35.
- Updated JPhyloref so that it can test pre-reasoned ontologies -- included in phyloref/jphyloref#9
- Finished Wojciechowski, 2003 and submitted for review as phyloref/clade-ontology#32
- Improve the process of indicating whether the curator expects a phyloreference to resolve correctly, thus ensuring that PHYX files can be included directly into the Clade Ontology, and release as Curation Tool v0.1.1
- Refactor some Curation Tool code into classes in order to eliminate some complex methods and to squash some final ESLint issues
- Add a test suite to the Curation Tool
- Continue moving terms from the Clade Ontology to the Phyloref Ontology by filing issues
- Write a curation protocol/documentation.
- Curate two papers.
- Compile use cases.
- Improve the process of indicating whether the curator expects a phyloreference to resolve correctly, thus ensuring that PHYX files can be included directly into the Clade Ontology, and release as Curation Tool v0.1.1
- Refactor some Curation Tool code into classes in order to eliminate some complex methods and to squash some final ESLint issues
- Add a test suite to the Curation Tool
- Continue moving terms from the Clade Ontology to the Phyloref Ontology by filing issues
- Helped write advertisement for next postdoc
- Cleaned up an old pull request on
phylo2owland resubmitted for review - Migrated all Todoist tasks to other platforms
- Work towards finishing Curation Tool v0.1.1, including:
- Refactor some Curation Tool code into classes in order to eliminate some complex methods and to squash some final ESLint issues
- Add a test suite to the Curation Tool
- Fix curation-tool#55 reported by Guanyang
- Improve curation UI as suggested by Guanyang
- Improve the process of indicating whether the curator expects a phyloreference to resolve correctly
- Continue discussions of specifiers, including
Specifierclass,has_internal_specifier,has_external_specifier,cdao:represents_TU - Plan and book travel to NZ for TDWG 2018
- Refactor some Curation Tool code into classes in order to eliminate some complex methods and to squash some final ESLint issues
- Add a test suite to the Curation Tool and fix some Curation Tool bugs reported by Guanyang
- Improve the process of indicating whether the curator expects a phyloreference to resolve correctly
- Continue filing issues to move terms into the formal ontology
- Finished and merged ESLint-related cleanup
- Finished Curation Tool v0.1 and submitted for review
- Continued discussions of specifiers, including
Specifierclass,has_internal_specifier,has_external_specifier,cdao:represents_TU
Updates
- Curated Huisman et al. (2003, which includes:
- Transcribing graphical tree to newick format, using a newly invented manual method.
- Entering data into curation tool and generating a JSON file.
Goals
- Curate additional definitions/papers
- Compile previously proposed use cases
- Finalize TDWG talks schedule
- Complete standardizing and refactor Curation Tool code formatting
- This includes fixing bugs discovered and reported by Guanyang.
- Fix pull request to fix jphyloref phyloreference testing code when using
expected_phyloreference_namedand submit for review - Add a mechanism in the Curation Tool to annotate phyloreferences that are not expected to resolve, likely by implementing phyloreference states
- Update a Clade Ontology pull request in which some phyloreferences cannot yet be implemented.
- Continue moving terms into the formal ontology by filing issues.
- (Next week) Start working on reasoning in the Curation Tool
Updates
- Curated definitions in Brochu (2003)
- Presented on curation experiment and documented issues concerning node labeling
- Drafted AB meeting blog and revised it twice.
- Curate definitions in Regnum and document time expenses.
- Compile list of use cases developed or proposed in a hackmd document.
- Finalize TDWG talk schedule.
- Follow up on code reviews on Curation Tool v0.1, which include:
- Fix pull request to fix jphyloref phyloreference testing code when using
expected_phyloreference_namedand submit for review - Add a mechanism in the Curation Tool to annotate phyloreferences that are not expected to resolve, likely by implementing phyloreference states
- Update a Clade Ontology pull request in which some phyloreferences cannot yet be implemented.
- Start moving terms into the formal ontology by filing issues.
- Standardize and refactor Curation Tool code formatting
- (Next week) Start working on reasoning in the Curation Tool
- Write blog for AB meeting.
- Curate Brochu 2003.
- Curate another paper with a larger phylogeny.
- Discuss use cases with Gaurav.
- Follow up on code reviews on Curation Tool v0.1, which include:
- Finish pull requests related to Curation Tool v0.1 and submit for review:
- Start moving terms into the formal ontology by filing issues.
- Rename repository ‘curation-workflow’ to a better name, possibly ‘ontology-of-phyloreferences’ (phyloref/curation-workflow#19)
- (After code reviews) Standardize and refactor Curation Tool code formatting
- (Next week) Start working on reasoning in the Curation Tool
Guanyang was in China.
- Continue curating two papers with phyloreferences: Bryant 1996 and Brochu 2003.
- Brochu 2003 has been curated. Bryant 1996 is being held up because we don't support matching of monomial names, which is scheduled to be built with Curation Tool v0.1.
- Reorganize multiple small changes made while testing into separate pull requests for review.
- Start moving terms into the formal ontology by filing issues.
- Rename repository ‘curation-workflow’ to a better name, possibly ‘ontology-of-phyloreferences’ (phyloref/curation-workflow#19)
- Close blog pull request and publish it.
- Write up several use cases and share with team.
- Review TDWG2018 abstracts.
- Start moving terms into the formal ontology by filing issues.
- Rename repository ‘curation-workflow’ to a better name, possibly ‘ontology-of-phyloreferences’ (phyloref/curation-workflow#19)
- Finish developing Curation Tool 0.1, which requires a few extra curator fields, support for renaming nodes on phylogenies, and further testing (phyloref/curation-workflow/projects/1)
- Worked on pull request for Curation Tool specifier matching, which has now been merged.
- Create a pull request for renaming nodes on a phylogeny and submitted it for review.
- Started work on curating two papers with phyloreferences: Bryant 1996 and Brochu 2003.
- This lead to multiple small changes, which I plan to cherry-pick out of its branch into separate pull requests once Curation Tool v0.1 is usable for curation.
- Did some research into Javascript code style and what a test suite for the Curation Tool might look like.
- Started work on renaming all Javascript variables from snake_case to camelCase.
- Apply for travel funding through the FLMNH.
- Go through meeting notes for the third f2f meeting, extract all tasks, and add them to Github as issues.
- Add a section on the Curation Tool to the third f2f meeting blog post.
- Reorganize Github Projects on the basis of the updated software development plan for the next six months. In particular:
- Rename the Curation Workflow to Ontology of Phyloreferences (phyloref/curation-workflow#19), and reorganize its Github projects
- The focus of development should be Curation Tool 0.1, which is a complete enough Curation Tool to create PHYX files that can be incorporated into the ontology of phyloreferences.
- This will be followed by the Curation Tool 0.2, which will incorporate reasoning directly into the application.
- We can take our time with Curation Tool 1.0, which will incorporate user interface feedback from users and create a more polished curation experience.
- Cleaned up Curation Tool specifier matching pull request and submitted to Hilmar for review (phyloref/curation-tool#10)
- Start moving terms into the formal ontology by filing issues.
- (Probably next week) Fully curate five papers into the Curation Workflow, with no outstanding test failures
Updates (Mar 12-16):
- Finished TDWG talk abstract. Link here.
- Discussed with Gaurav about upcoming f2f meeting and designated tasks.
- Manually mapped some of Wilcox & Hillis's Rana clade definitions to Che et al (2007). Contacted John Wiens and obtained a data file of his phylogeny in Wiens et al. (2009).
Goals/tasks
- Prepare for f2f meeting.
- F2f meeting.
Updates:
- TDWG talk abstract 70% done. Talk will focus on OTT-VTO taxonomy mapping and data integration/synthesis with Phenoscape, GBIF and Traitbank.
- Contacted ~30 researchers to solicit talks. Received 10 applications and 3 abstract submissions.
- Discussed with Scott Chamberlain over email about a tool to perform "phylogeny-aware" data queries. He is considering writing an R package to do this and giving a TDWG talk.
Goal/tasks for this week:
- Finish TDWG abstract.
- Prepare for upcoming f2f meeting: (1) aligning perspectives, (2) phyloreferencing: theoretical and practical considerations, (3) pathway towards developing potential use cases, and (4) tractable research problems. More specifically, gather a list of research questions as a base from which we will develop use cases.
- Review submitted TDWG abstracts.
- Apply clade definitions of Wilcox & Hillis (2005) to Che et al (2007) and Wiens et al. (2009), two newer and larger phylogenies.
- Review a PeerJ manuscript on Asillidae phylogeny.
- Finish draft(s) of TDWG conference presentation(s)
- Create a working prototype of the curation tool
- Follow up on review of pull request for basic application phyloref/curation-tool#5
- Finish pull request for matching specifiers phyloref/curation-tool#10
- Improve Curation Tool to perform reasoning on phyloreferences
- Write a reasoner that the Curation Tool can use phyloref/curation-tool#3
- Fully curate five papers into the Curation Workflow, with no outstanding test failures
- Follow up on review of pull request phyloref/curation-workflow#17
- Identify and import three other papers into the Curation Workflow using the Curation Tool.
- Extensive discussions with Guanyang about use cases for Phyloreferencing, including discussion of previously raised use-cases
- Continue recruiting speakers for the TDWG symposium.
- Finish two TDWG abstracts, one on OTT-VTO mapping for own symposium, and another on data and knowledge for a different symposium (probably as contributed).
- Conference call with Hilmar and Emily (already happened on Mon).
- Curation Tool v0.1
- Follow up on pull request phyloref/curation-tool#5
- Add specifier matching to the Curation Tool phyloref/curation-tool#6
- Fully curate five papers into the Curation Workflow, with no outstanding test failures
- Follow up on pull request phyloref/curation-workflow#17
- Identify and import three other papers into the Curation Workflow using the Curation Tool.
- Start draft(s) of TDWG conference presentation(s)
- Write initial agenda for f2f meeting next week
Updates
- Met with TDWG symposium co-organizers online, finalized a draft to call for abstracts and distributed it via Evoldir, Taxacom and Twitter.
- Glimpsed Hillis & Wilcox (2005) and Hillis (2007).
- Met with David Hillis and talked about phyloreferencing and other aspects of phylogenetic nomenclature during formal meeting and socials.
- Discussed with other researchers about the prospect of using a rank-free taxonomy to organize natural history collections. I was contemplating a little bit applying phyloreferencing to collections. See Facebook and Twitter threads.
Goals/tasks
- Send emails to recruit symposium speakers. List of names here.
- Distribute Call for Abstracts more broadly, including various society FB groups and listservs.
- Decide on a topic for presentation at TDWG2018. See last week's log for possibilities.
- Write a small report on the meeting with D. Hillis and share it internally.
- Prepare a write-up on the issues of integrating data in face of unstable clade content.
- Finish pull request of Curation Tool v0.1 (phyloref/curation-tool#5)
- Fix specimen matching in line with Hilmar's review (phyloref/curation-workflow#17)
- Fully curate five papers into the Curation Workflow, with no outstanding test failures
- Identify and import three other papers into the Curation Workflow using the Curation Tool.
- (probably next week) Create a separate pull request for specifier matching in the Curation Tool.
- (probably next week) Propagate some changes from the Curation Tool back into the Curation Workflow:
- Remove '@id' from study (phyloref/curation-tool#2, phyloref/curation-workflow#15)
- Resolve confusion between scientific name and dwc:scientificName (phyloref/curation-workflow#16)
- Standardize cases of terms in the testcase document (phyloref/curation-workflow#12)
- Rename the term for a JSON file consisting of phylogenies and phyloreference as a "curated study" (phyloref/curation-workflow#18)
- (next week) Improve Curation Tool to perform reasoning on phyloreferences
- Write a reasoner that the Curation Tool can use (phyloref/curation-tool#3).
I spent most of my efforts on digging into the case of Basidiomycota and organizing the TDWG2018 symposium. After some discussions within our project team and with David Hibbet, we may conclude that the same definition may resolve to clades with different compositions on different phylogenies. We assert that the clade definition remains unchanged. What might be the practical implications of this theoretical stance? What questions potential stakeholders (biologists, taxonomists and biodiversity researchers) of our phyloreferencing project might ask when presented with such a viewpoint? More generally, how does phyloreferencing enable better systematics and biodiversity research, while the theoretical underpinning diverges quite significantly from "traditional" thinking? To explore those issues, it will be worthwhile to investigate the following questions. (1) How does changing clade composition affect data integration of organismal traits? (2) How should a clade name be used on different phylogenies? (3) Node and branch-based definitions of the same name may resolve to different clades. What are the philosophical and empirical bases for composing a definition and choosing between the two methods? (4) Can phyloreferencing facilitate comparison of clade compositions? I would welcome any comments on those questions, especially regarding how they may drive the development of actual phyloreferencing use cases.
For this week, I am going to focus primarily on TDWG organizing as well as preparing for a talk abstract(s) for that meeting.
- [x ] TDWG2018 organizing - conference call, distribute funding application form and invite speakers.
- TDWG2018 abstract preparation. Some possible presentation topics:
- OTT/VTO/GBIF taxonomy matching and phylogeny-driven data integration.
- Integrate phylogenetic information into biodiversity databases.
- Rank-free taxonomy in biodiversity databases.
- Closing the gap between data and knowledge - how do we discover new natural history knowledge in biodiversity data?
- [x ] With David Hillis visiting our lab on Friday, I would like to take a close look at Rana.
- Create a working prototype of the Curation Tool
- Add support for editing specifiers to the Curation Tool
- Fully curate five papers into the Curation Workflow, with no outstanding test failures
- Identify and import three other papers into the Curation Workflow using the Curation Tool.
- Improve Curation Tool to perform reasoning on phyloreferences
- Write a reasoner than the Curation Tool can use (phyloref/curation-tool#3).
I queried clade definitions stored Phyloregnum against the Open Tree of Life (OTL), focusing on node-based definitions (based on internal specifiers). The results of the queries are recorded here. While most queries returned a clade (a node that represents the Most Recent Common Ancestor, or MRCA of the specifiers used in the query) in OTL with the same name as the clade definition, some returned a clade without a name, or a different name. An interesting case is Basidiomycota. A query using the four specifiers listed in Phyloregnum returned a clade (or MRCA) labeled as "h2007-1", which includes Basidiomycota, Ascomycota and Entorrhizomycota. After a bit of readings, I found out about what is going on here. Entorrhiza casparyana was formerly part of Basidiomycota, but it got classified into a recently proposed and named phylum "Entorrhizomycota" (Bauer et al., 2015), which is placed as the sister to Dikarya (Basidiomycota and Ascomycota). OTL reflects this taxonomic treatment and that phylogenetic relationship. A query using the four specifiers of the former, more broadly defined Basidiomycota should concern only two lineages in OTL, namely, the redefined, narrow Basidiomycota and the new phylum Entorrhizomycota, but not Ascomycota. Basidiomycota s.s. (sensu strico) and Entorrhizomycota formed a paraphyletic relationship with respect to Ascomycota, so the MRCA would have to be the node that has all three lineages as children, which is h2007-1 (no idea where that comes from). I can draw some simple trees to illustrate this, if you feel that may help with your understanding.
This Basidiomycota case demonstrates how a clade definition could identify different clades when the underlying phylogeny changes. It is true that the definition did not change, but it appears to be clear that the same definition references two different clades on two different phylogenies, with the same specifiers. This seems contradicting to the goal of phyloreferencing, i.e, creating stable, precise clade definitions. Can somebody offer a different view on how to interpret this, or why this might not be a problem, or how phyloreferencing will address this?
Another challenge is the feasibility to compare the OTL clade composition with that of the original or intended clade. We cannot assume that a clade with the same name would definitely mean the "same" clade. If that assumption holds, then we would not have the messy issue of the same name referring to different things, which is quite commonplace. The original publication of the clade definition usually included just a small phylogeny with some "higher-level" taxa labeled, rather than all of their descendants listed. This it leaves lots of room for interpretation as to how the lineages on the original phylogeny compares to those in OTL. Also, that comparison is difficult to make manually as OTL contains hundreds and thousands of tips for any given clades (at the "higher" level). One idea of Phyloreferencing is to give computable definition (or concept) to a clade, and I have been pondering how we can use phyloreferences to compare clades computationally, but I think I can use a little help there. For the aforementioned issues, can we discuss them as our Wed conference call agendas?
Goals for the week
For this week, I would like to continue doing the Phyloregnum queries. Besides documenting the search results, a specific goal for that is explore if and how phyloreferencing can be used to compare clade compositions. This is something biologists will find highly desirable.
- Continue Phyloregnum definition queries agains OTL.
- Explore if and how phyloreferencing can be used to compare clade compositions.
- (Unfinished goal from previous week) Understand the nature of phylogenetic clade definitions (e.g., intensional or ostensive). Read Ghiselin, 1984, Ghiselin, 1995, and Rieppel, 2006. Skim Stanford & Kitcher
- Create a working prototype of the Curation Tool
- Add support for editing specifiers to the Curation Tool
- Fully curate five papers into the Curation Workflow, with no outstanding test failures
- Work on pull request tracking unmatched specifiers in the Curation Workflow phyloref/curation-workflow#9
- Add support for specimen matching to the Curation Workflow: phyloref/curation-workflow#8
During the the past week, I started looking at clade definitions stored in PhyloRegnum and explored the possibility of using those to retrieve clades from the Open Tree's synthetic tree. I played with Open Tree's API to learn how to find the MRCA (most recent common ancestor) of a set of OTT ids, or a node in the synthetic tree. It is possible to locate a clade using a set of internal specifier, but I will need to investigate further the relationship of the clade recovered from Open Tree and the one defined in the original publication. Just because a node in the Open Tree bears the same name as the original clade does not necessarily mean they are the "same" or congruent clade.
For the coming weeks, I hope to a nail down a concrete project that I can present at TDWG 2018. This will be something along the line of reconciling taxonomy and phylogenies, using open tree and clade definitions. Abstract submission is due on Mar 12.
- Retrieve clades from Open Tree based on clade definitions stored in Phyloregnum. Understand concept relationship.
- For the OTT-VTO matching project, drill down into the Scleropages problem. [Did not work on this]
- (Unfinished goal from previous week) Understand the nature of phylogenetic clade definitions (e.g., intensional or ostensive). Read Ghiselin, 1984, Ghiselin, 1995, and Rieppel, 2006. Skim Stanford & Kitcher
- Fully curate five papers into the Curation Workflow, with no outstanding test failures
- Add support for specimen matching to the Curation Workflow: phyloref/curation-workflow#8
- Add support for identifying unmatched specifiers in the Curation Workflow: phyloref/curation-workflow#9
- Create a working prototype of the Curation Tool
- Add support for editing specifiers to the Curation Tool
During last week, other than preparing the TDWG symposium abstract, I worked mainly on the OTT (open tree taxonomy) project. I formulated research questions and started exploring study cases to address these questions. For this week, I'd like to continue working on this project. I plan to focus on three groups, Scleropages fishes, one group of insects (probably Reduviidae, which I'm rather familiar), and Campanulaceae (or Campanula). I think this project will illustrate the problems of matching phylogeny with taxonomy and make a plead for phylogenetically defined names. I will also document how the findings may inform phyloreferencing software development.
- Outreach
- Get a review and publish the node-matching blog post
- Fully curate five papers into the Curation Workflow, with no outstanding test failures
- Add support for specimen matching to the Curation Workflow: phyloref/curation-workflow#8
- Started work on this in curation-workflow/specimen_matching, but ended up confusing myself. I think I've managed to unconfuse myself, so I should be able to implement this this week.
- Add support for identifying unmatched specifiers in the Curation Workflow: phyloref/curation-workflow#9
- Started work on this in curation-workflow/report_unmatched_tu and jphyloref/report_unmatched_tu, but requires better error reporting from the add-labels.py script before this is done.
- Add support for specimen matching to the Curation Workflow: phyloref/curation-workflow#8
- Create a working prototype of the Curation Tool
- Add support for editing specifiers to the Curation Tool
- Get a review of our Curation Tool software development plan, including the explanation for PhyloRegnum/Curation Tool integration yet, and when we should make this decision.
- Created an ObservableHQ shared notebook for displaying the MRCA subtree for a set of taxa. This will be useful to play around with right now and is a first step towards Open Tree integration later in this project.
Updates. Met with Nico and had a conference call with phyloref team. Discussed and clarified some issues regarding the design of use cases. It was agreed that outreach to Golife projects will be best done after we have a working tool and some worked out use cases. I will focus on further developing the Phenoscape project, i.e., mapping two different taxonomies, but will do so from a phyloref perspective. What I would also like to do is devise dummy use cases of phyloreferences, which should demonstrate the range of expected applications and issues. I prepared a TDWD 2018 symposium proposal. Other than those, I prioritized on a job phone interview last week, and I think it went pretty well.
Goals.
- Finalize TDWD symposium proposal. Contact potential speakers (if need to). 1 day
- For the Opentree-Phenoscape taxonomy mapping project, find one manageable but interesting fish group which fulfills the following criteria: 1) it has a phylogeny and/or the phylogeny has been updated, 2) that phylogeny has been used to study evolutionary questions, 3) taxonomic history of that group can be traced easily. Apply phyloreferences to this group. 2 days
- Finish writing the review on Berendsohn & Kennedy's approach on concept taxonomy. 2 days
- Understand the nature of phylogenetic clade definitions (intensional or ostensive). Read Ghiselin, 1984, Ghiselin, 1995, and Rieppel, 2006. Skim Stanford & Kitcher
- Outreach
- Publish the blog post from the Durham trip
- Finish and publish the node-matching blog post
- Ensure that we get a good proposals in to the call for symposia and workshops for the TDWG/SPNHC 2018 meeting, due January 29.
- Create a working prototype of the Curation Tool
- Start developing Curation Tool, version 0.1.
- Write up a position description on why we are not making a decision on PhyloRegnum/Curation Tool integration yet, and when we should make this decision.
- Fully curate five papers into the Curation Workflow, with no outstanding test failures
- Add support for specimen matching to the Curation Workflow: phyloref/curation-workflow#8
- Started work on this in curation-workflow/specimen_matching, but ended up confusing myself. I think I've managed to unconfuse myself, so I should be able to implement this this week.
- Add support for identifying unmatched specifiers in the Curation Workflow: phyloref/curation-workflow#9
- Started work on this in curation-workflow/report_unmatched_tu and jphyloref/report_unmatched_tu, but requires better error reporting from the add-labels.py script before this is done.
- Add support for specimen matching to the Curation Workflow: phyloref/curation-workflow#8
- Phone interview
- Continue working on the open tree taxonomy project developed at Phenoscape hackathon
- TDWG symposium proposal/abstract
- Get feedback on the blog post from the Durham trip.
- Respond to Hilmar's feedback on the curation workflow and the node-matching blog post.
- Add support for specimen matching to the Curation Workflow: phyloref/curation-workflow#8
- Started work on this in curation-workflow/specimen_matching, but ended up confusing myself. I think I've managed to unconfuse myself, so I should be able to implement this this week.
- Add support for identifying unmatched specifiers in the Curation Workflow: phyloref/curation-workflow#9
- Started work on this in curation-workflow/report_unmatched_tu and jphyloref/report_unmatched_tu, but requires better error reporting from the add-labels.py script before this is done.
- Start developing Curation Tool, version 0.1.
- Ensure that we start writing proposals for the call for symposia and workshops for the TDWG/SPNHC 2018 meeting, due January 24.
Updates:
-
I re-read some of Berendsohn's writings and also took a closer look at his two database models -- IOPI and MoReTax, which used concept taxonomy. The former is like a checklist, but occurrences of a name from different sources (taxonomic concepts, essentially) are curated separately and there are some concept relationship statements. It also links out to other databases, but only via name strings (not concepts). The second model describes concept relationships in a more sophisticated way and how the "transmittability of linked information" could be achieved (a kind of data integration, I suppose). I was in the process of summarizing these findings. I would like to do a bit more, but not too much, investigation on if and how these data models were used.
-
I started writing down my questions and thoughts about our project, but also more generally about taxonomy and biodiversity data integration. Link to the doc will be shared via email.
-
Read Introduction and some of the M&M of Franz et al's latest manuscript on reconciling two bird phylogenies, which feels more or less like another test case of running Euler/X. The authors claim at least two major conceptual advances: concept congruences at higher-level can be attained in spite of non-overlapping taxon sampling and diverging perspectives can be reconciled and verbalized, which according to the authors is a more powerful method/language for integrating evolving phylogenomic research.
This week's goals:
- Continue developing the concept taxonomy critique
- Develop a more thorough analysis on two particular issues: (1) why Berendsohn and Kennedy's concept taxonomy models did not get widely used in specimen databases? The history appears to be not well documented. I might have to approach Berendsohn himself for a direct conversation. Not a huge priority, though (meaning that the critique can still be developed even I fail to present a full historical analysis of Berendsohn's concept taxonomy). (2) For each of the perceived challenges facing concept taxonomy that I have outlined, I would like to expand the writing to something more substantial. What I have is somewhat superficial and constitute opinions, rather than a critique. More specifically, I will need to illustrate the challenges using examples, hypothetical or actual.
- Learn how to write on own computer and push commits to Github.
- Continue developing the list of questions I have about our project.
- Write the blog post on the Durham trip.
- Currently in review at https://github.com/phyloref/phyloref.github.io/pull/31
- Respond to Hilmar's feedback on the curation workflow and the node-matching blog post.
- Add support for specimen matching to the Curation Workflow: phyloref/curation-workflow#8
- Started work on this in curation-workflow/specimen_matching, but ended up confusing myself. I think I've managed to unconfuse myself, so I should be able to implement this this week.
- Add support for identifying unmatched specifiers in the Curation Workflow: phyloref/curation-workflow#9
- Started work on this in curation-workflow/report_unmatched_tu and jphyloref/report_unmatched_tu, but requires better error reporting from the add-labels.py script before this is done.
- Start developing Curation Tool, version 0.1.
- Figured out how to run the Phyloref website in Docker on Windows.
- Continue writing the critique of Concept Taxonomy, more specifically,
- Read and review several papers (Kennedy et al., 2005; Berendsohn) and survey databases that had or attempted to implement some aspects of Concept Taxonomy (Berendsohn's plant/bryophyte database, GBIF, Fishbase).
- Try to understand and write about why concept taxonomy sensu Berendsohn & Kennedy did not get adopted by major databases.
- Read Nico Franz's latest MS on reconciling two bird phylogenomic studies.
- Write down a list of questions that I have about the Phyloreferencing project.
- Learn how to use Github to manage projects. Watch this video
- Get feedback on my three pull requests, with the goal of getting the blog post published this weekend or early next week.
- Blog post on matching specifiers: phyloref/phyloref.github.io#28
- Big pull request on the Curation Workflow: phyloref/curation-workflow#7
- Updated the Java reasoner in line with the above two changes: phyloref/jphyloref#1
- Develop an initial design for the Curation Tool that can be reviewed by the other team members.
- Start developing Curation Tool, version 0.1.
- Add support for specimen matching to the Curation Workflow: phyloref/curation-workflow#8
- Add support for identifying unmatched specifiers in the Curation Workflow: phyloref/curation-workflow#9
Funded by the US National Science Foundation through collaborative grants DBI-1458484 and DBI-1458604. See Funding for details.