160 likes | 176 Views
Explore the integration of SciENcv and ORCiD profiles in the CTSAsearch system to improve visualizations and enhance search functionalities for a large network of relationships. Includes social network analysis, spatial constraints, and future entity inclusions.
E N D
Visualizing Virtual Communitiesin the VIVO Profile Ecosystem David Eichmann, Alexis Graves, SyedaMominaTabish, Colin Grove University of Iowa
Motivation • CTSAsearch statistics: • 80 institutions • 3,640,188 distinct person URIs • Est. 174,675 – 421,640 “real” people • 7,664,680 authorship relationships • 49,177,109 coauthorshippairs
Goals Improve the user experience with increasingly complex, large relationship networks Begin to model more than just words as text strings in user queries Begin to model more than just people in the visualizations
Enter SciENcv SciENcv is NIH’s system to help researchers ‘assemble the professional information needed for participation in federally funded research’ OHSU (as lead) and Iowa received a CTSA administrative supplement to explore integration of SciENcv, CTSAsearch and ORCiD
VIVO-ISF – SciENcv/ORCiD Integration Part of our charge was to enable authors of SciENcv and/or ORCiD profiles to have their profiles appear in CTSAsearch results Direct API connections into SciENcv are still pending, but we have demonstrated ingest of batch dumps of XML-formatted data
CTSAsearch – OpenVIVO integration We participated in the planning and deployment of OpenVIVO at FORCE11 this year in Portland The goal from our end was near-real-time (hourly!) integration of current OpenVIVO into CTSAsearch results
CTSAsearch – OpenVIVO integration • The required architectural changes included • Full automation of the refresh pipeline • Refactoring the Lucene index generators • Separate index now for each site • Site indices are merged into the aggregate index
CTSAsearch interface enhancements • Timeline visualization of papers citing specific grants and the papers citing those papers • Based upon Jim Onken’s (NIH) wish list in his plenary talk at VIVO2015 • Search for organization units in addition to people
CTSAsearch interface enhancements UMLS concept recognition in profiles and in query text Concept generalization/specialization links to allow for query refinement
CTSAsearch interface enhancements Social network analysis to identify communities within search results Filtering of search results based upon spatial constraints
First notion of distance: Spatial Twists
Spatial Twists • The reality on a spherical Earth: • Fortunately, PostgreSQL has GIS extensions! • point(x.long, x.lat) <@> point(y.long, y.lat)
Future Work Inclusion of additional entities in visualizations (e.g., papers) Increased focus on temporal constraints and animations Inclusion of additional entities in modeling and search (particularly clinical trials and involved personnel)
CTSAsearch Acknowledgements Current activity supported by NIH NCATS grant 3UL1TR000128-10S1 to OHSU Many discussions over the project with Melissa Haendel (OHSU) VIVO-ISF 1.6 D2RQ mapping done by Marijane White at OHSU
Questions? • CTSAsearch (for now): • http://research.icts.uiowa.edu/polyglot • david-eichmann@uiowa.edu