Knowledge Base Resources

Use these links “vetted” by the community. Additional CI links are always welcome.

Paraview UArizona HPC links (beginner)

These links take you to visualization resources supported by the University of Arizona's HPC visualization consultant (rtdatavis.github.io). The following links are specific to the Paraview program and the workflows that have been used my researchers at the U of Arizona. Some of the pages linked are very beginner friendly: getting started, working with cameras and keyframes for rendering, visualizing external files (netcdf climate data), graphs and data exporting. Many of the workflows involve using remote desktops via the Open On Demand interface, but if this isn't set up at your university you can use paraview locally on a desktop. Feel free to post on access ci https://ask.cyberinfrastructure.org/ if you need assistance getting a paraview gui open for your work on HPC.

visualization

0 Likes

Type

documentation

Level

Flag as

OpenMP Tutorial

https://www.openmp.org/resources/tutorials-articles/

OpenMP (Open Multi-Processing) is an API that supports multi-platform shared-memory multiprocessing programming in C, C++, and Fortran on many platforms, instruction-set architectures and operating systems, including Solaris, AIX, FreeBSD, HP-UX, Linux, macOS, and Windows. It consists of a set of compiler directives, library routines, and environment variables that influence run-time behavior.

parallelization c c++fortran programming

0 Likes

Type

learning

Level

Flag as

Introduction to MP

A “Hands-on” Introduction to OpenMP*

Open Multi-Processing, is an API designed to simplify the integration of parallelism in software development, particularly for applications running on multi-core processors and shared-memory systems. It is an important resource as it goes over what openMP and ways to work with it. It is especially important because it provides a straightforward way to express parallelism in code through pragma directives, making it easier to create parallel regions, parallelize loops, and define critical sections. The key benefit of OpenMP lies in its ease of use, automatic thread management, and portability across various compilers and platforms. For app development, especially in the context of mobile or desktop applications, OpenMP can enhance performance by leveraging the capabilities of modern multi-core processors. By parallelizing computationally intensive tasks, such as image processing, data analysis, or simulations, apps can run faster and more efficiently, providing a smoother user experience and taking full advantage of the available hardware resources. OpenMP's scalability allows apps to adapt to different hardware configurations, making it a valuable tool for developers aiming to optimize their software for a range of devices and platforms.

expanse faster c c++compiling openmp programming

0 Likes

Type

presentation

Level

Flag as

Probabilistic Semantic Data Association for Collaborative Human-Robot Sensing

Probabilistic Semantic Data Association for Collaborative Human-Robot Sensing

Humans cannot always be treated as oracles for collaborative sensing. Robots thus need to maintain beliefs over unknown world states when receiving semantic data from humans, as well as account for possible discrepancies between human-provided data and these beliefs. To this end, this paper introduces the problem of semantic data association (SDA) in relation to conventional data association problems for sensor fusion. It then, develops a novel probabilistic semantic data association (PSDA) algorithm to rigorously address SDA in general settings. Simulations of a multi-object search task show that PSDA enables robust collaborative state estimation under a wide range of conditions.

ai machine-learning

0 Likes

Type

documentation

Level

Flag as

High performance computing 101

High performance computing 101

An introductory guide to High Performance Computing.

administering-hpc

0 Likes

Type

website

Level

Flag as

ConnectCI

https://cnct.ci

Connect.Cybinfrastructure is a family of portals, each representing a program that is serving a segment of the research computing and data community. Each portal provides program-specific information, as well a custom "view" into a common database. The portal was originally developed to support project workflows and a knowledge base of self service learning resources for the Northeast Cyberteam. Subsequently, it was expanded to provide support to multiple cyberteams and other research computing communities of practice. We welcome additional communities, please contact us if you are interested in participating. Central to the Portal is an extensive and ever-evolving tagging infrastructure which informs every aspect of the Portal. The tag taxonomy was initially developed by the Northeast Cyberteam to categorize subject matter relevant to practitioners of Research Computing Facilitation and is ever changing due to the frequent introduction of new technology in domains that characterize the field of research computing.

community-outreach

0 Likes

Type

website

Level

Flag as

Regular Expressions

Regular expressions (sometimes referred to as RegEx) is an incredibly powerful tool that is used to define string patterns for "find" or "find and replace" operations on strings, or for input validation. Regular Expressions are used in search engines, in search and replace dialogs of word processors and text editors, and text-processing Linux utilities such as sed and awk. They are supported in many programming languages, including Python, R, Perl, Java, and others.

perl programming python r

0 Likes

Type

learning

Level

Flag as

GDAL Multi-threading

GDAL Multi-threading

Multi-threading guidance when using GDAL.

parallelization gis

0 Likes

Type

learning

Level

Flag as

DELTA Introductory Video

DELTA Youtube Video

Introductory video about DELTA. Speaker Tim Boerner, Senior Assistant Director, NCSA

delta gpu training

0 Likes

Type

video

Level

Flag as

Use Windows Subsystem for Linux for HPC Command Line Access from Windows

Install Linux on Windows with WSL

Windows Subsystem for Linux (WSL) provides a Linux environment for Windows users to access HPC resources fast and efficiently.

workflow ssh

0 Likes

Type

tool

Level

Flag as

What are LSTMs?

Introduction to LSTMs

This reading will explain what a long short-term memory neural network is. LSTMs are a type of neural networks that rely on both past and present data to make decisions about future data. It relies on loops back to previous data to make such decisions. This makes LSTMs very good for predicting time-dependent behavior.

ai deep-learning machine-learning neural-networks

0 Likes

Type

learning

Level

Flag as

Metadata Systems

Metadata Systems

Metadata is a vital topic in libraries and librarianship, encompassing structured information used for accessing digital resources. The definition of metadata varies but is essentially data about data. It has evolved beyond simply describing metadata schemas and now focuses on topics like interoperability, non-descriptive metadata (administrative and preservation metadata), and the effective application of metadata schemas for user discovery. Interoperability, the ability to seamlessly exchange metadata between systems, is a major concern. Different levels of interoperability are examined, including schema-level, record-level, and repository-level. Challenges to interoperability include variations in standards, collaboration barriers, and costs.Metadata management is discussed in terms of the holistic management of metadata across an entire library. Steps include analyzing metadata requirements, adopting schema, creating metadata content, delivery/access, evaluation, and maintenance. Administrative metadata, which encompasses ownership and production information, is becoming more critical, particularly for electronic resource licensing. Preservation metadata is also gaining importance in ensuring the long-term viability of digital objects.

metadata

0 Likes

Type

learning

Level

Flag as

Horovod: Distributed deep learning training framework

Horovod

Horovod is a distributed deep learning training framework. Using horovod, a single-GPU training script can be scaled to train across many GPUs in parallel. The library supports popular deep learning framework such as TensorFlow, Keras, PyTorch, and Apache MXNet.

deep-learning distributed-computing gpu

0 Likes

Type

tool

Level

Flag as

Docker Tutorial for Beginners

Docker Tutorial for Beginners

A Docker tutorial for beginners is a course that teaches the basics of Docker, a containerization platform that allows you to package your application and its dependencies into a standardized unit for development, shipment, and deployment.

docker

0 Likes

Type

video_link

Level

Flag as

RRCoP Resources Page

RRCoP External resources Page

Very helpful list of Regulated Research Community of Practice's collaborating communities.

community-outreach cybersecurity

0 Likes

Type

website

Level

Flag as

Conda

Conda Tutorial

Conda is a popular package management system. This tutorial introduces you to Conda and walks you through managing Python, your environment, and packages.

anaconda conda python

0 Likes

Type

tool

Level

Flag as

Applications of Machine Learning in Engineering and Parameter Tuning Tutorial

Applications of ML in Engineering and Parameter Tuning Tutorial (RMACC 2019)

Slides for a tutorial on Machine Learning applications in Engineering and parameter tuning given at the RMACC conference 2019.

data-analysis machine-learning python

0 Likes

Type

learning

Level

Flag as

Thrust resources

Thrust is a CUDA library that optimizes parallelization on the GPU for you. The Thrust tutorial is great for beginners. The documentation is helpful for anyone using Thrust.

parallelization gpu resources

0 Likes

Type

learning

Level

Flag as

Slurm Tutorials

Slurm Tutorials

Introduction to the Slurm Workload Manager for users and system administrators, plus some material for Slurm programmers.

administering-hpc cluster-management hpc-cluster-architecture training

0 Likes

Type

learning

Level

Flag as

Astronomy data analysis with astropy

astropy

Astropy is a community-driven package that offers core functionalities needed for astrophysical computations and data analysis. From coordinate transformations to time and date handling, unit conversions, and cosmological calculations, Astropy ensures that astronomers can focus on their research without getting bogged down by the intricacies of programming. This guide walks you through practical usage of astropy from CCD data reduction to computing galactic orbits of stars.

visualization image-processing astrophysics

0 Likes

Type

learning

Level

Flag as

Recommended Libraries for Cyberinfrastructure Users Developing Jupyter Notebooks

Recommended Libraries for Cyberinfrastructure Users Developing Jupyter Notebooks

This repository contains information about Jupyter Widgets and how they can be used to develop interactive workflows, data dashboards, and web applications that can be run on HPC systems and science gateways. Easy to build web applications are not only useful for scientists. They can also be used by software engineers and system admins who want to quickly create tools tools for file management and more!

0 Likes

Type

website

Level

Flag as

Research Software Engineering Training Materials

Training Links

An ongoing collection of RSE training material, workshops, and resources. We are compiling this list as a starting point for future activities. We are especially seeking material that goes beyond basic research computing competency (e.g. what The Carpentries does so well) and is general enough to span multiple domains. Specific tools and technologies used only in one domain, or applicable to only one subset of computing (i.e. HPC) are typically too narrowly focused. When in doubt, submit it to be included or reach out and we’d be happy to discuss.

0 Likes

Type

website

Level

Flag as

Neocortex Documentation

Neocortex Documentation

Neocortex is a new supercomputing cluster at the Pittsburgh Supercomputing Center (PSC) that features groundbreaking AI hardware from Cerebras Systems.

documentation ai deep-learning neural-networks hardware

0 Likes

Type

documentation

Level

Flag as

Better Scientific Software (BSSw)

The Better Scientific Software (BSSw) project provides a community to collaborate and learn about best practices in scientific software development. Software—the foundation of discovery in computational science & engineering—faces increasing complexity in computational models and computer architectures. BSSw provides a central hub for the community to address pressing challenges in software productivity, quality, and sustainability.

community-outreach project-management research-facilitation workforce-development

0 Likes

Type

website

Level

Flag as

Introduction to Probabilistic Graphical Models

https://ermongroup.github.io/cs228-notes/

This website summarizes the notes of Stanford's introductory course on probabilistic graphical models. It starts from the very basics and concludes by explaining from first principles the variational auto-encoder, an important probabilistic model that is also one of the most influential recent results in deep learning.

ai machine-learning

0 Likes

Type

learning

Level

Flag as

Campus Champions

Knowledge Base Resources

Topics

Programming Language

Science Domain

Skill Level

Content Type