Skip navigation and jump directly to page content

 IU Trident Indiana University

Open Source Software

One of the critical ways in which PTI and IU impacts and enables scientific research at IU, in the State of Indiana, and the nation as a whole through the distribution of open source software. Such software also aids the development of the Indiana economy. These open source software products often are used to the benefit of the private sector in Indiana and the nation as a whole. The production of open source software also enhances IU's competitiveness for federal grant funding, aiding PTI and IU in bringing competitively awarded federal research monies into Indiana and creating new high quality jobs in Indiana.

The production of open source software also enhances the value of Indiana University's scholarly output, since the use of open source software facilitates reproduce ability and testability of scientific research.

A list of open source software products produced and currently distributed by PTI and its affiliated organizations is given below:

PTI Research Centers

Last Updated 11/05/15

Software Name URL Description PTI Center
Curbee Suite of microservices applied to research objects to enhance (curate) them for publication. Minimally, it includes validation and packaging, but can be extended to any services needed for quality curation. Data 2 Insight
SEAD Matchmaker Recommendation engine that acts as matchmaker between people, data, and repositories to identify the most appropriate repositories for publication and publishing of research data objects. Data 2 Insight
SEAD-IU Cloud Storage and publication option for SEAD users that relies on IU Scholarly Data Archive and its robust distributed storage with two copies of data made by default. Data 2 Insight
Komadu Provenance collection tool that can be added to an existing cyberinfrastructure for the purpose of collecting and visualizing data lineage (e.g., dataset derivation, versioning and so on).  Data 2 Insight
HathiTrust Research Center secure commons Suite of services provisioning computational analysis to the nearly 14 million digitized books (70% in copyright) in the HathiTrust digital library. Data 2 Insight
HTRC Data Capsules Secure environment for text analysis at scale of sensitive digitized content.  Data 2 Insight
Avalon Media System Open source system for providing online access to audio and video collections Data 2 Insight
PhotoCat2 Image cataloging tool Data 2 Insight
Variations 6.3 Digital music library system (update) Data 2 Insight
Variations FRBR Source Release Experimental discovery system for music metadata Data 2 Insight
Kuali OLE 1.6 Open Library Environment Data 2 Insight
RIS2N3 Converts RIS citation format to N3 format for loading into VIVO. Data 2 Insight
Phoebus Data movement service Data 2 Insight
XSP Network library Data 2 Insight
Karma The Karma tool is a standalone tool that can be added to existing cyberinfrastructure for purposes of collection and representation of provenance data. Karma utilizes a modular architecture that permits support for multiple instrumentation plugins that make it usable in different architectural settings. Data 2 Insight
NetKarma NetKarma allows researchers to see the exact state of the network and store configuration of the experiment and its slice. THe provenance of the data will be stored and visualized through a data portal. The provenance data can be used by the researcher to analyze their data, allow for the suspension and resumption of an experiment and provide a single reference to find the details and data collected in an experiment. Data 2 Insight
NetKarma plugin to Cytoscape   Provides visualization of metadata   Data 2 Insight
Sigiri Sigiri Job Scheduler provides a simple abstraction for interaction with heterogeneous resource managers spanning grid and cloud computing, and features that make the tool useful for the midscale physical or natural scientist. Data 2 Insight
XMC Cat XMC Cat is a metadata catalog that stores rich metadata describing data objects that are themselves stored in files, storage repositories, or on the web. Data 2 Insight

PTI Service and Cyberinfrastructure Centers

Software Name URL Description PTI Center
Open Science Grid middleware
Middleware (communication) software for Open Science Grid Research Technologies
XLiveCD Open source system to allow Microsoft Windows users to access Linux X-term applications and graphics Research Technologies
Apache Rave Open Social and W3C Widget-compliant Web portal software for building Science Gateways and component-based, standard compliant social networking sites. Rave is a collaboration between Hippo Software, SURFNet, Mitre Corp, and the IU Science Gateway Group. Research Technologies
Apache Airavata Tools for building scientific workflows and science gateways, wrapping scientific applications as secure network services, delivering distributed messages for collaborating applications, and registering online services and workflows. Research Technologies
Apache Airavata PGA Apache Airavata PHP reference implementation Research Technologies
SciGaP The SciGaP project develops science gateways as a service using Apache Airavata and related software. Research Technologies
GeoGateway GeoGateway is a data product search and analysis gateway for scientific discovery, field use, and disaster response Research Technologies
Lustre UID/GID Mapping and Shared Key code   Contribution to Lustre base open source distribution allows secure use of Lustre file system over wide area networks Research Technologies
Trinity RNA sequence assembly NCGAS
mlRho Population mutation rates NCGAS
CAFE Computational analysis of (gene) family evolution NCGAS
Heterogeneity test Coalescent test for differences in allele frequency distributions NCGAS
Perlymorphism A suite of population genetics tools for large DNA sequence datasets NCGAS

Inactive / Historical Open Source Software from PTI

Software Name URL Description PTI Center
GeneIndex Seeks words of various lengths specified by users NCGAS
fastDNAml Maximum likelihood inference of evolutionary trees NCGAS
Life Sciences DataBase Parsers Software to input data into IBM DiscoveryLink NCGAS
PENELOPE-MPI Open source but not distributed in US without permission from government. Parallel (supercomputer) software for studying radiation transport used in planning chemotherapy for brain cancer Research Technologies
Pubsonline Open source bibliographic management Research Technologies
SMBL (Simple Message Broker Library) Tool for message passing interface programming with distributed systems Research Technologies
High Performance Storage Systems (HPSS) connector for the Java framework   Contribution to HPSS base distribution to enable access via Java applets Research Technologies
Twister Twister is a software tool that supports faster execution of many data mining applications implemented as MapReduce programs. The tool extends the functionality of MapReduce, a distributed programming technique patented by Google for large-scale data processing in datacenter environments. Twister allows MapReduce to achieve higher performance, perform faster data transfers, and reduce the time it takes to process vast sets of data for data mining and machine learning applications. Digital Science Center
Fault Tolerant Hybrid Information Service (FTHPIS) FTHPIS is a hybrid grid information service supporting both the scalability of large amounts of relatively slowly varying data and a high performance rapidly updated information service for dynamic regions. This generic implementation supports two information service implementations: ExtendedUDDI and WS-Context Digital Science Center
mpiJava mpiJava is an object-oriented Java interface to the standard Message Passing Interface for parallel programming (MPI). It provides the functionality of MPI for programmers wishing to write parallel programs in the Java programming language. It is implemented as a set of Java wrappers to various native MPI packages. Digital Science Center
Global-MMCS The Global-MMCS Project tries to build a collaboration system, which integrates various services including videoconference, instant messaging and streaming, support multiple videoconferencing technologies and heterogeneous collaboration environments. Digital Science Center
NaradaBrokering NaradaBrokering is a content distribution infrastructure, and enables the development of secure, failure-resilient Systems. Digital Science Center
Carousel Community Grid for PDA project developed an environment supporting ubiquitous accessing to Community Grid systems from various small wireless devices such as Smartphone. Digital Science Center
HPJava HPJava is an environment for scientific and parallel programming using Java. It is based on an extended version of the Java language. Digital Science Center
Klatsch The purpose of the Klatsch framework is to provide an easy-to-program, flexibleinterface for exploring and analyzing feeds of social media data. Digital Science Center