Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
Presenter: Chen Li, PhD. Professor, Department of Computer Science, University of California Irvine
Abstract
Many data analytics projects have collaborators with complementary backgrounds, including biologists, bioinformaticians, computer scientists, and AI/ML experts. Many of them have limited experience to code, set up a computing infrastructure, and use MLmodels. Existing tools and services, such as email attachments, GitHub, and Google Drive are inefficient for sharing data and analyses. In this talk, we present an open source system called Texera that provides a cloud computing platform for collaborators to share data and analyses as workflows. After seven years of development, the system has a rich set of powerful features, such as shared editing, shared execution, version control, commenting, debugging, user-defined functions in multiple languages (e.g., Python, R, Java), and support of state-of-the-art AI/ML techniques. Its backend parallel engine enables scalable computation on large data sets using computing clusters. We will show a demo of the system, and present our vision supported by a recent NIH award, dkNET(NIDDK Information Network, https://dknet.org), to serve the diabetes, endocrinology, and metabolic diseases research communities through the FAIR sharing of data and knowledge.
Resource link: https://github.com/Texera/texera
Dial-in Information:
https://uchealth.zoom.us/meeting/register/tZMrcuuvrTgsHdaSU_sHRUiygD5_l5kOhbfq
Date/Time: Friday, April 26, 2024, 11 am - 12 pm PT
Upcoming webinars schedule: https://dknet.org/about/webinar