CaRCC Activities at PEARC21

Greetings to members of the Campus Research Computing (CaRCC) Communities!

The PEARC21 Virtual Conference (July 19-22) is almost here. Be sure to attend the many great workshops, tutorials, presentations, and panels from persons in the community. And here are some of the CaRCC-related activities you won’t want to miss:

Town Hall

Panel: Campus Research Computing Consortium (CaRCC) Town Hall
Wednesday July 21st, 9:20 AM – 10:50 AM PDT
Thomas Cheatham, University of Utah; Dana Brunson, Internet2;  Ruth Marinshaw, Stanford University; Lauren Michael, University of Wisconsin-Madison; Claire Mizumoto, UC San Diego; Patrick Schmitz, Semper Cogito; Scott Yockel, Harvard University

This panel will gather CaRCC leaders and community members to discuss recent products and significant activities that CaRCC has supported as well as new initiatives for 2021 and beyond. CaRCC is always interested to hear the concerns of our community and their ideas for what CaRCC (in partnership with other community organizations) can do to better support RCD Professionals.

RCD Capabilities Model

Workshop: Building a Strategic Plan for your Research Computing and Data Program
Monday July 19th, 8:00 am – 11:00 am (PT)
Dana Brunson, Internet2; Doug Jennewein, Arizona State University; Claire Mizumoto, UC San Diego; Patrick Schmitz, Semper Cogito; Scotty Strachan, University of Nevada, Reno

This workshop will gather Research Computing and Data (RCD) professionals to learn leading practices for developing effective strategic plans for their Research Computing and Data programs. Attendees will discuss the range of RCD strategic planning models across the community, and approaches to building a strong strategic planning practice.

Paper: Assessing the Landscape of Research Computing and Data Support
Tuesday July 20th 9:50 AM – 10:20 AM PDT
Patrick Schmitz, Semper Cogito

We describe the first Research Computing and Data Capabilities Model Community Dataset, aggregating the assessments of 41 Higher Education Institutions. This Community Dataset provides insight into the current state of support for RCD across the community and in a number of key sub-communities. The dataset shows stark differences between Public and Private institutions, between institutions with a larger and smaller share of national funding, etc. 

BoF: Assessing and communicating cyberinfrastructure readiness at EPSCoR and under-resourced institutions
Tuesday July 20th, 11:15 AM – 12:15 PM PDT
Scotty Strachan, University of Nevada, Reno; Venice Bayrd, Montana State University; Patrick Schmitz, Semper Cogito; Dana Brunson, Internet2; Pips Veazey, University of Maine; Gwen Jacobs, University of Hawai’i

This BoF, which is part of a series of EPSCoR-focused engagement events for 2020-2022, brings together RCD Capabilities Model Working Group members and research technology support personnel from NSF-EPSCoR institutions to review ongoing challenges to assess and communicate institutional cyberinfrastructure (CI) readiness, discuss current model engagement efforts, provide feedback to the Model development process, and brainstorm potential cooperative CI efforts across the EPSCoR program. 

RCD Professionalization

BoF: Quantifying the Research Computing and Data Professional Community for Attracting, Retaining, and Diversifying RCD Professionals 
Wednesday, July 21 • 11:00am – 12:00pm
Christina Maimone (Northwestern University), Scott Yockel (Harvard University), Amy Neeser (UC Berkeley), Ashley Stauffer (Penn State University), Kimberly Grasch (University of Chicago), and Timothy Middelkoop (Internet2)

Participants in the National Science Foundation Virtual Workshop on the Research Innovation Workforce for Cyberinfrastructure identified recruiting and sustaining a diverse and inclusive workforce as a key challenge for the future of the RCD field, yet no systematic data is currently available on the current composition of the RCD workforce. A breakout group from this workshop joined the CaRCC RCD Professionalization Working Group, and set out to better understand quantitatively the state of RCD staffing across the United States by designing, testing, and implementing an RCD workforce survey tool and using that to conduct a national survey. During the Birds of Feather, the working group will provide an overview of the current state of this work and solicit feedback from the audience on a number of topics.

Emerging Centers (People Network)

Workshop: Refining Your Research Computing Pitch
Monday, July 19 • 12:00pm – 3:00pm
Jane Combs, University of Cincinnati; Richard Knepper, Cornell University (Emerging Centers track coordinators)

This PEARC workshop is for professionals (center leaders, facilitators, faculty champions, etc) of existing or emerging research computing organizations (even those “one-person shops”) to get feedback on and develop their communication materials. We hope through this workshop to provide a clearing-house of template materials that will make campus outreach easier for research computing professionals, which can be reviewed and improved by participants over time.

People Network Calls, April 2021

Greetings, everyone. Welcome to Spring and April… No Joke!

Please mark your calendars for these upcoming People Network remote Zoom meetings. For handy calendar entries please try the CaRCC Events calendar.

Data-Facing Track (first Tuesdays)

Digital Scholarship Platforms and Workflows – HathiTrust Research Center and Model of Models
Eleanor Koehl (HathiTrust) and Erin McCabe (University of Cincinnati)
Tuesday, April 6, 1p ET/ 12p CT/ 11a MT/ 10a PT/ 8a HT

HathiTrust is the largest non-profit digital library in the world, and roughly 2/3 of the collection is not available for human reading. This presentation will discuss how the HathiTrust Research Center leverages compute resources at Indiana University to make text data from the HathiTrust Digital Library available for text data mining. HTRC services attempt to meet the needs of scholars with a range of skill levels, who use a variety of research methods.

UC’s Digital Scholarship Center (DSC) has developed its own platform for text mining and visualization of large-scale unstructured language datasets. This presentation will provide a demonstration of the platform in addition to reviewing topic modeling concepts that form its primary visualizations. Additionally, we will look at 1-2 analytical approaches to the platform’s output, as well as cover some of the DSC’s work / challenges with data curation, parallel modeling, and working with researchers across disciplines. 

Researcher-Facing Track (second Thursdays)

On Measuring the Impact of Training
Presentations by Kari Jordan (Carpentries), and Julie Wilson Rojewski and Astri Briliyanti, CyberAmbassadors
Thurs, April 8th, 1p ET/ 12p CT/ 11a MT/ 10a PT/ 8a HT

On previous Researcher-Facing calls, we’ve had the opportunity to discuss topics relating to measuring impact and improving training. And as discussed in the Leading Practices of Facilitation, “training & education” is one of the major pillars of our efforts. Many of us provide training opportunities and struggle to define and measure “impact” or “success” — is it short terms gains (quality scores for the class & instructors, reduced support burden, and acclimating users), long term considerations (effectiveness of training programs, building relationships, promoting awareness and participation), and does it depend on the kind of training (professional skills, technical topics)?  Or are we confounding these, complicating both the objectives and outcomes?

April’s call will showcase two “case studies” of measuring training impact, where each presenter will talk about their programs, define “impact”, and explain their approach to measuring this. Please also join us by contributing to our pre-talk survey: What challenges do you currently face in measuring training impact? And what successful strategies have you tried?

Emerging Centers Track (third Wednesdays)

New Resources Available to the National Research Community: Jetstream 2, Bridges 2, and Anvil
Wednesday, April 21st, 12pm ET/ 11am CT/ 10am MT/ 9am PT/ 7am HT

Representatives from Indiana University, Pittsburgh Supercomputing Center, and Purdue University will discuss the new systems at each of these sites and their capabilities offered to the national research community. All of these resources will be available to researchers nationwide via the XSEDE project allocations system.

Systems-Facing Track (third Thursdays)

Experiences and Advice for Large and Small Data Centers – Cooling
Thursday April 15th, 1p ET/ 12p CT/ 11a MT/ 10a PT/ 8a HT  

Our panel will discuss experiences managing cooling (heat) in data centers – from large scale systems to clusters in closets. The brief presentations will include experiences designing and managing cooling for their infrastructure. Our panelists will take questions from participants and discuss options. Any questions about power/security will be collated for a future session.

General Track Call Information

Interested participants need not subscribe to a particular track to participate in calls. However, additional details for track members, including notes documents and any pre-call activities, will be distributed ahead of the call via the email lists and other communication channels within each track.

All calls will take place within the same Zoom room distributed via email. Please join the People Network (link just above) or contact for details.

The CaRCC People Network aims “to foster, build and grow an inclusive community (termed the “People Network”) for campus CI, research computing and data professionals.” If you have received this information NOT via CaRCC’s People Network email list and you would like to join the People Network – Researcher-Facing, Data-Facing, Systems-Facing, Emerging-Centers, and other tracks – please fill in our Join the People Network form.