What data to keep? — Making decisions about confocal microscopy data
Presenters: Huajin Wang, Librarian/ Program Director for Open Science & Data Collaborations, Carnegie Mellon University Libraries and Susan Ivey, Research Data & Infrastructure Librarian, NC State University Libraries
As the quantity and volume of data produced by research increases exponentially, it has become increasingly challenging to preserve and reproduce data. Traditionally, researchers have often created their own workflows and their own data storage solutions, but this is no longer sustainable, making collaborations and data sharing challenging. On the other hand, data librarians are tasked with helping researchers share and preserve their data, but understanding specific types of data and how to maximize reuse can be difficult. Large and complex data exist in a variety of disciplinary areas, and one example is confocal microscopy data. In April 2019, the Data Curation Network held their 2nd Data Curators Workshop at Johns Hopkins University. Susan Ivey, Amy Koshoffer, Gretchen Sneff, and Huajin Wang formed a group to address many of these issues associated with confocal microscopy data. During this July’s Data-Facing Call, we’ll go into detail about common workflow and challenges that researchers face when working with confocal microscopy data and give an overview of our “Confocal Microscopy Data: A Primer for Curators,” which we created to help those tasked with curating this type of data. We’ll also present some of the use cases that we used to inform this work and invite the audience to think about how to best preserve and share these data.
Topic: How are we doing? A discussion on philosophies/culture, approaches, and tools for understanding creation of knowledge, metrics, and impact
Our recent calls, from handling support requests via various tools/modes, to remote work for support/consultations and training, to working with your (remote) team in this unprecedented time, have been an unexpected but fruitful journey. This month we close the loop: from your overall efforts of the team — both internal- and external-facing work — and via processes and tools, what is your philosophy on and how are you measuring your activities and impact? What information do you gather around internal- or researcher-facing activities? Are you using approaches or tools that harness NLP, ML, or predictive analytics? And do you have specific goals that you strive towards? Join us and share.
Wednesday, May 20, 12pm ET/ 11am CT/ 10am MT/ 9am PT/ 7am HT Note: The Data-Facing and Systems-Facing calls will NOT happen at their normal times in May; please join the joint call above instead.
Join us for a community panel on effectively incorporating student workers into research computing and data groups. Topics will include: hiring, development, structuring student positions, work assignments, challenges, managing remote work, and training. Bring your questions for the panelists.
Panelists: Amy Neeser (University of California Berkeley); Tony Elam (University of Kentucky); Colby Witherup Wood and Alper Kinaci (Northwestern University); Amy Work and Stephanie Labou (University of California San Diego); Betsy Hillery (Purdue University); Joanne Luciano (University of the Virgin Islands); Brian Haymore (University of Utah); Troy Baer (OSC)
Description: Let’s continue the discussion around remote support and how you make it work. This month, we’ll focus on how you work with your team – how have your collaboration objectives with your team shifted given our work-at-home reality? How effective have they been? What would you do differently? And what has surprised you? Add your comments and questions to the call document in advance of the start of the meeting or anytime throughout the call!
Have questions about how to get started with the Research Computing and Data Capabilities Model? Or are you already working with it and just want to discuss the process, or a particular aspect of the assessment tool? Join working group members at one of our upcoming Office Hours to get help, ask your questions, and share your experiences! The next few Office Hours are scheduled for:
Data-facing work in XSEDE Extended Collaborative Support Services (ECSS), Sergiu Sanielevici (PSC)
Description: The Extended Collaborative Support Service (ECSS) improves the productivity of the Extreme Science and Engineering Discovery Environment (XSEDE) user community through collaborations to optimize their applications, their work and data flows, and engages practitioners of disciplines that have not traditionally used advanced cyberinfrastructure (ACI). Novel & Innovative Projects (NIP) has the primary responsibility within ECSS for this latter task. NIP provides mentoring to help projects be successful and advice on the use of technologies such as virtual environments, machine learning, virtualization and containers. INIP is now focused on helping AI and “big data” projects on novel SP resources scheduled to enter production in 2020, including Bridges-2 at PSC and Expanse at SDSC.
Doing it in public: User support via Slack, open forums, Github repos and blogs, a panel and community discussion
Description: To address the increasing number of support requests in research computing, we need to leverage strategies to amplify our support efforts. One such strategy is to provide support in public environments like Slack, open forums, Github repos and blogs which allows consultants to answer questions once and encourage researchers to support each other. Please join us as we explore the benefits and pitfalls of this approach.
Help identify and prioritize topics for future calls! This month’s call will be an open forum to discuss the direction of the track and brainstorming call topic ideas. What would you like to see more (or less) of? Who can we partner with or add to the conversation?
Data librarians may become “data science” librarians by necessity, if they are not already serving in this role. In this call, we will provide an overview of the particulars of two libraries’ data science support beyond data science (e.g., across departments, collaborating with IT, etc.) and have an open discussion of how other groups are dealing with similar support needs.
JANUARY 7 – Identify/prioritize topics for the coming year!
Sharing on behalf of the PEARC20 Program Committee:
Deadlines: (Updated Jan 8)
January 22nd: Tutorial submissions due
January 22nd: Workshop submissions due
February 17th: Technical track full paper submissions due
February 17th: Lightning Talk Abstracts submissions due
February 24th: Student technical track full paper submissions due
April 24th: Poster submissions due
April 24th: Student posters submissions due
May 1st: Panel submissions due
May 1st: BOF submissions due
May 1st: Viz Showcase submissions due
May 15th: All camera-ready submissions due
PEARC20 will explore the current Practice and Experience in Advanced Research Computing, including modeling, simulation, and data-intensive computing. PEARC20 will be in Portland, OR from July 26th-30th, 2020. This year’s theme, “Catch the Wave,” embodies the spirit of the community’s drive to stay in front of the new waves in technology, analytics, data, visualization, and a globally connected and diverse workforce.
PEARC20 brings together community thought leaders, CI professionals, and students to learn, share ideas, and craft the infrastructure of the future. The PEARC20 student program will provide students with a range of opportunities to participate in both student activities and the full technical program so that they may share their research efforts and gain insights and inspiration from like-minded individuals at the conference.
The session will include a series of lightning round style case studies from a variety of institutions whose libraries and IT departments are collaborating. This will be followed by a discussion about these approaches and collaboration more broadly.