Announcing the V1.1 Release of the RCD Capabilities Model

Internet2, CaRCC, and EDUCAUSE are pleased to announce that there is a new version of the Research Computing and Data Capabilities Model (RCD CM) just in time for you to complete your Institutional assessment and contribute to the 2021 RCD CM Community Dataset!

The Research Computing and Data Capabilities Model allows institutions to assess their support for computationally- and data-intensive research, to identify potential areas for improvement, and to understand how the broader community views Research Computing and Data support.

Continue reading “Announcing the V1.1 Release of the RCD Capabilities Model”

2021 RCD Capabilities Model Office Hours

Have questions about how to get started with the Research Computing and Data Capabilities Model? Or are you already working with it and just want to discuss the process, or a particular aspect of the assessment tool? Join working group members and your colleagues in the community at one of our upcoming Office Hours to get help, ask your questions, and share your experiences!

Office Hours for 2021 are scheduled for the:

2nd Wednesdays of each month for 1 hour at 11am ET (new)

4th Tuesday of each month for 1 hour, at 2 pm ET.

Update: We’ve added an extra session on September 14th to help with questions leading up to the 2021 Community Dataset submission deadline (at the end of September).

Those dates are:

  • April 27
  • May 25
  • June 22
  • July 14 and 27
  • August 11 and 24
  • September 8 and 28 plus extra hour on September 14, 2pm ET
  • October 13 and 26
  • November 10 and 23

For details on how to join, email the RCD CM working group at capsmodel-help@carcc.org, or subscribe to the capsmodel-discuss@carcc.org discussion list.

People Network Calls, April 2021

Greetings, everyone. Welcome to Spring and April… No Joke!

Please mark your calendars for these upcoming People Network remote Zoom meetings. For handy calendar entries please try the CaRCC Events calendar.

Data-Facing Track (first Tuesdays)

Digital Scholarship Platforms and Workflows – HathiTrust Research Center and Model of Models
Eleanor Koehl (HathiTrust) and Erin McCabe (University of Cincinnati)
Tuesday, April 6, 1p ET/ 12p CT/ 11a MT/ 10a PT/ 8a HT

HathiTrust is the largest non-profit digital library in the world, and roughly 2/3 of the collection is not available for human reading. This presentation will discuss how the HathiTrust Research Center leverages compute resources at Indiana University to make text data from the HathiTrust Digital Library available for text data mining. HTRC services attempt to meet the needs of scholars with a range of skill levels, who use a variety of research methods.

UC’s Digital Scholarship Center (DSC) has developed its own platform for text mining and visualization of large-scale unstructured language datasets. This presentation will provide a demonstration of the platform in addition to reviewing topic modeling concepts that form its primary visualizations. Additionally, we will look at 1-2 analytical approaches to the platform’s output, as well as cover some of the DSC’s work / challenges with data curation, parallel modeling, and working with researchers across disciplines. 

Researcher-Facing Track (second Thursdays)

On Measuring the Impact of Training
Presentations by Kari Jordan (Carpentries), and Julie Wilson Rojewski and Astri Briliyanti, CyberAmbassadors
Thurs, April 8th, 1p ET/ 12p CT/ 11a MT/ 10a PT/ 8a HT

On previous Researcher-Facing calls, we’ve had the opportunity to discuss topics relating to measuring impact and improving training. And as discussed in the Leading Practices of Facilitation, “training & education” is one of the major pillars of our efforts. Many of us provide training opportunities and struggle to define and measure “impact” or “success” — is it short terms gains (quality scores for the class & instructors, reduced support burden, and acclimating users), long term considerations (effectiveness of training programs, building relationships, promoting awareness and participation), and does it depend on the kind of training (professional skills, technical topics)?  Or are we confounding these, complicating both the objectives and outcomes?

April’s call will showcase two “case studies” of measuring training impact, where each presenter will talk about their programs, define “impact”, and explain their approach to measuring this. Please also join us by contributing to our pre-talk survey: What challenges do you currently face in measuring training impact? And what successful strategies have you tried?

Emerging Centers Track (third Wednesdays)

New Resources Available to the National Research Community: Jetstream 2, Bridges 2, and Anvil
Wednesday, April 21st, 12pm ET/ 11am CT/ 10am MT/ 9am PT/ 7am HT

Representatives from Indiana University, Pittsburgh Supercomputing Center, and Purdue University will discuss the new systems at each of these sites and their capabilities offered to the national research community. All of these resources will be available to researchers nationwide via the XSEDE project allocations system.

Systems-Facing Track (third Thursdays)

Experiences and Advice for Large and Small Data Centers – Cooling
Thursday April 15th, 1p ET/ 12p CT/ 11a MT/ 10a PT/ 8a HT  

Our panel will discuss experiences managing cooling (heat) in data centers – from large scale systems to clusters in closets. The brief presentations will include experiences designing and managing cooling for their infrastructure. Our panelists will take questions from participants and discuss options. Any questions about power/security will be collated for a future session.

General Track Call Information

Interested participants need not subscribe to a particular track to participate in calls. However, additional details for track members, including notes documents and any pre-call activities, will be distributed ahead of the call via the email lists and other communication channels within each track.

All calls will take place within the same Zoom room distributed via email. Please join the People Network (link just above) or contact help@carcc.org for details.

The CaRCC People Network aims “to foster, build and grow an inclusive community (termed the “People Network”) for campus CI, research computing and data professionals.” If you have received this information NOT via CaRCC’s People Network email list and you would like to join the People Network – Researcher-Facing, Data-Facing, Systems-Facing, Emerging-Centers, and other tracks – please fill in our Join the People Network form.

Join us for March People Network Calls

Mark your calendars for these upcoming People Network virtual meetings. (For handy calendar entries, try the CaRCC Events Calendar.)

Plenary session hosted by the Data-Facing Track (first Tuesdays)

Tuesday, March 2, 1p ET/ 12p CT/ 11a MT/ 10a PT

PLENARY: Using Data to Benchmark your Research Computing and Data Program: The RCD Capabilities Model and Community Dataset

Presenters: Claire Mizumoto, UC San Diego & Patrick Schmitz, Semper Cognito Consulting

Join us during the regular data-facing slot for the first (and hopefully not last) People Network plenary session! Claire Mizumoto and Patrick Schmitz from the Capabilities Model working group will present the results from the first community dataset. These assessments were completed using the 1.0 version of the Research Computing and Data Capabilities Model (RCD CM), over a period of several months in the Spring and Summer of 2020. This Community Dataset provides insight into the current state of support for RCD across the community and in a number of key sub-communities.

Across science, engineering, social sciences, and the humanities, every university depends upon research computing and data (RCD) professionals and infrastructure. The rapid evolution and diversification of RCD infrastructure, services, and support poses significant challenges to academic institutions as they try to effectively assess and plan for the growing needs of researchers. Many institutions would also like to assess their capabilities in comparison to peers. The lack of a shared vocabulary to describe the various aspects of RCD support hinders coordinated efforts to advance support of and for researchers. These challenges are especially acute for smaller and emerging RCD support organizations, which often lack experience supporting RCD and have limited resources to develop an analysis framework for strategic planning. To address these gaps, a collaborative team developed a Research Computing and Data Capabilities Model that allows an organization to self-evaluate across a range of RCD services. The Model provides structured input to guide strategic planning, leveraging a defined and shared community vocabulary and enabling benchmarking relative to peer institutions.

Researcher-Facing Track (second Thursdays)

Thursday, March 11, 1pm ET (12pm CT/11am MT/10am PT)

All About Orienting Researchers to Research Computing + Data Resources

Continue reading “Join us for March People Network Calls”

2020 RCD CM Community Dataset report available

Scatter Plot showing the capabilities coverage for all 41 institutions

The report describes the first Research Computing and Data Capabilities Model Community Dataset, aggregating the assessments of 41 Higher Education Institutions. These assessments were completed using the 1.0 version of the Research Computing and Data Capabilities Model (RCD CM), over a period of several months in the Spring and Summer of 2020. This Community Dataset provides insight into the current state of support for RCD across the community and in a number of key sub-communities. Download it now! (zenodo.org/record/4344057).

Update August 2021: See also our PEARC21 paper (which won the Best Full Paper award for the Workforce Development, Training, Diversity, & Education track): Assessing the Landscape of Research Computing and Data Support: The 2020 RCD Capabilities Model Community Dataset (also available here if you don’t have ACM access).

The Research Computing and Data Capabilities Model allows institutions to assess their support for computationally- and data-intensive research, to identify potential areas for improvement, and to understand how the broader community views Research Computing and Data support. The Model was developed by a diverse group of institutions with a range of support models, in a collaboration among Internet2, CaRCC, and EDUCAUSE. This Assessment Tool is designed for use by a range of roles at each institution, from front-line support through campus leadership, and is intended to be inclusive across small and large, and public and private institutions. 

We encourage you to check out the Capabilities Model, and begin to use it at your institution. Start with the Capabilities Model Introduction and Guide to Use, which includes background as well as tips for using the model, and a link to the access request form that will create a personalized copy of the Assessment Tool for your institution.  You can also watch the recording of the EDUCAUSE webinar. Keep an eye on the RCD CM working group page for more information and updates.

Welcome to February 2021!

Please see below for People Network calls this month, and make sure to join in! (For handy calendar entries, see the CaRCC Events Calendar.) 

Data-Facing Track (first Tuesdays)

Tuesday, February 2, 1p ET/ 12p CT/ 11a MT/ 10a PT

Casual Tuesday Community Roundtable

We want to take this month to hear a bit about what everyone is up to. This session will be a general sharing and free-form brainstorming session. We’d love to hear 3-5 minutes about projects that people are working on currently or new developments. If you’re stuck on something, feel free to bring that forward and get advice from the brain trust. We can also make breakout rooms for deeper discussions that arise.

Researcher-Facing Track (second Thursdays)

Thursday, February 11, 1pm ET (12pm CT/11am MT/10am PT)

Supporting Researchers with Containers

Continue reading “Welcome to February 2021!”

People Network Calls in the New Year

We hope you have a wonderful holiday season and are able to join People Network calls in January! (For handy calendar entries, see the CaRCC Events Calendar.) 

Data-Facing Track (first Tuesdays)

Tuesday, January 5, 1p ET/ 12p CT/ 11a MT/ 10a PT

Python for Big Data

Presenter: Bala Desinghu, Rutgers University

Python is a popular programming language for developing software and data science applications. Its popularity stems from many factors such as simplicity, readability, portability, etc. As such, Python is slow compared to C or Fortran and it does not manage memory well. These limitations in speed and memory management may not be significant when analyzing small data sets, but it becomes a bottleneck when analyzing big data sets. Techniques based on vectorization, parallelization, just in time compilation, and distributed task executions have been widely adopted by the Python community to address these challenges associated with big data. This presentation will address a few techniques suitable for large scale data analysis and answer the following questions: What to do when the data set size exceeds the available physical memory? How to speed up the data analysis? How to distribute the workloads when doing machine learning for big data sets?

Researcher-Facing Track (second Thursdays)

Thursday, January 14, 1p ET/ 12p CT/ 11a MT/ 10a PT

All about CaRCC (… beyond the R-F Track)

Presenters:
Tom Cheatham, University of Utah
Lauren Michael, University of Wisconsin
Dana Brunson, Internet2
Patrick Schmitz, Semper Cogito Consulting

Continue reading “People Network Calls in the New Year”

Join the December People Network Calls, including a Party!

We’ll have just three community calls in December (to avoid conflicts with winter vacation plans), including a cross-network party! (For handy calendar entries, see the CaRCC Events Calendar.) 

CaRCC End-Of-Year Party

Thursday, December 10, 1-2:30pm ET / 12-1:30pm CT / 11am-12:30pm MT / 10-11:30am PT

We’ll use the usual Zoom room, and Zoom’s new support for self-select breakout rooms, with rooms designated for following topics: Main Room: Greetings, Hors D’oeuvres, and Games; SC After-Party; People Network Brainstorm (share your ideas!); All About CaRCC Working and Interest Groups.

Make sure you’ve updated your Zoom client since September 21 (to version 5.3.0 or higher).

Data-Facing Track (first Tuesdays)

Tuesday, December 1, 1p ET/ 12p CT/ 11a MT/ 10a PT

Creating a standard Vocabulary: the Unified Medical Language System (UMLS)

David Anderson from the National Library of Medicine will present on the Unified Medical Language System (UMLS). The UMLS is a set of files and software that brings together many health and biomedical vocabularies and standards to enable interoperability between computer systems. David will provide a basic overview of the UMLS including its history and use cases.

Emerging Centers Track (third Wednesdays)

Wednesday, December 16, 12p ET/ 11a CT/ 10a MT/ 9a PT

NSF Program Directors – Campus Cyberinfrastructure (CC*) NSF 21-528

You will not want to miss this call! The NSF CC* program is a perfect program for emerging centers to begin developing their research partnerships and center resources.

Kevin L. Thompson, NSF CISE/OAC and Deepankar (Deep) Medhi, NSF CISE/CNS, will share information about the recently released CC* Program solicitation and answer any questions you may have.  Jen Schopf, Director of EPOC, will then answer some commonly asked questions that EPOC has heard from previous submitters to the CC* program.  She will briefly explain how the EPOC program experts can assist you in the development of your proposal.   

We encourage previous recipients of the award to share any lessons learned or tips and tricks for developing a successful proposal.

Continue reading “Join the December People Network Calls, including a Party!”