Author Archives: Alice Allen

Open Digital Infrastructure in Astrophysics

I spent two days last week at the Open Digital Infrastructure in Astrophysics meeting at the Kavli Institute for Theoretical Physics (KITP) at UC Santa Barbara. This meeting featured presentations on open-knowledge digital infrastructure projects, the communities around them, their metrics for success, funding, diversity efforts, and plans for sustainability. Yeah, we’re talking code, a lot of code, and code projects, too, from AstroPy to yt, and data, and efforts that support openness and research transparency.

Open data presentations were given on:

STScI data, which includes JWST, Hubble, and PanSTARRS data, and the discovery and analysis software for these archives, by Arfon Smith
SDSS Data Infrastructure, by Joel Brownstein
LSST Transients data, by Federica Bianco
Open gravitational wave data and software tools for these data, by Duncan Brown

These software projects were represented at the meeting:

photograph of the Kavli Institute for Theoretical PhysicsAstropy, by Kelle Cruz
ATHENA++, by Jim Stone
Einstein Toolkit, by Philipp Mösta
emcee, by Daniel Foreman-Mackey
GYRE, by Rich Townsend
JETFIT, by Andrew Macfadyen
MESA Project, by Frank Timmes
TOM Toolkit and the AEON Network, by Rachel Street
yt, by Matt Turk

Other open digital resources presented were:

Journal of Open Source Software, by Arfon Smith
R astrostatistics, by Gwendolyn Eadie
Astrophysics Source Code Library, by yours truly

The meeting hashtag was #OpenAstroInfra, and many of the presentations were live tweeted. They were also video recorded and the podcasts are available on the KITP media page for the meeting, as are most of the slide decks. Participants of the co-located “Better Stars, Better Planets: Exploiting the Stellar-Exoplanetary Synergy” and “The New Era of Gravitational-Wave Physics and Astrophysics” programs were encouraged to attend, and we had a raven or two trying to have lunch with us as well.

Each of the presentations had about 15 minutes devoted to questions and discussion about the project highlighted. In two of these discussion sessions, the presenters were asked whether they were concerned about “improper use” of a code; sometimes people who are not well-schooled in the theory or science underlying a software package will use the code incorrectly, arriving at results that are dodgy, or downright wrong, and in a few cases (I know of only one), have then claimed the software is in error. This fear has been given as the reason some software authors do not release their code. I was cheering in my head with Jim Stone’s response to this question the first time it came up; he stated that there is so much benefit to making the code available that a potential improper use should not stop release. (YES!!!) He further went on to say, as did others in the room, that science will correct the record (YES!!!!!). I could not agree more with these replies, and it was great to hear these sentiments from others.

This was my first visit to KITP, and what a wonderful introduction to the institution it was! So many excellent projects, and so much exciting work being done in the open! My thanks to organizers Frank Timmes, Lars Bildsten, and Rich Townsend for inviting the ASCL to participate, and to the Sloan and Ford Foundations for funding the meeting.

ASCL presentation slides

May 2019 additions to the ASCL

Twenty-seven codes were added to the ASCL in May, 2019:

Astrocut: Tools for creating cutouts of TESS images
Bandmerge: Merge data from different wavebands
beamModelTester: Model evaluation for fixed antenna phased array radio telescopes
Binospec: Data reduction pipeline for the Binospec imaging spectrograph
CASI-2D: Convolutional Approach to Shell Identification – 2D

ClusterPyXT: Galaxy cluster pipeline for X-ray temperature maps
evolstate: Assign simple evolutionary states to stars
FastPM: Scaling N-body Particle Mesh solver
Fermitools: Fermi Science Tools
Fitsverify: FITS file format-verification tool

Grizli: Grism redshift and line analysis software
HAOS-DIPER: HAO Spectral Diagnostic Package For Emitted Radiation
LensCNN: Gravitational lens detector
LensQuEst: CMB Lensing QUadratic Estimator
MMIRS-DRP: MMIRS Data Reduction Pipeline

NAPLES: Numerical Analysis of PLanetary EncounterS
ODEPACK: Ordinary differential equation solver library
PICASO: Planetary Intensity Code for Atmospheric Scattering Observations
Prospector: Stellar Population inference from spectra and SEDs
Py4CAtS: PYthon for Computational ATmospheric Spectroscopy

PyPDR: Chemistry, thermal balance, and molecular excitation code
Q3C: A PostgreSQL package for spatial queries and cross-matches of large astronomical catalogs
rPICARD: Radboud PIpeline for the Calibration of high Angular Resolution Data
SEDPY: Modules for storing and operating on astronomical source spectral energy distribution
SICON: Stokes Inversion based on COnvolutional Neural networks

SPARK: K-band Multi Object Spectrograph data reduction
THALASSA: Orbit propagator for near-Earth and cislunar space

April 2019 additions to the ASCL

Thirty codes were added to the ASCL in April, 2019:

AutoBayes: Automatic design of customized analysis algorithms and programs
CausticFrog: 1D Lagrangian Simulation Package
CDAWeb: Coordinated Data Analysis Web
CGS: Collisionless Galactic Simulator
CLEAR: CANDELS Ly-alpha Emission at Reionization processing pipeline and library

covdisc: Disconnected covariance of 2-point functions in large-scale structure of the Universe
deproject: Deprojection of two-dimensional annular X-ray spectra
dfitspy: A dfits/fitsort implementation in Python
digest2: NEO binary classifier
ehtim: Imaging, analysis, and simulation software for radio interferometry

EightBitTransit: Calculate light curves from pixel grids
eleanor: Extracted and systematics-corrected light curves for TESS-observed stars
FortesFit: Flexible spectral energy distribution modelling with a Bayesian backbone
GALAXY: N-body simulation software for isolated, collisionless stellar systems
JVarStar: Variable Star Analysis Library

nbodykit: Massively parallel, large-scale structure toolkit
nudec_BSM: Neutrino Decoupling Beyond the Standard Model
OoT: Out-of-Transit Light Curve Generator
Properimage: Image coaddition and subtraction
pyRSD: Accurate predictions for the clustering of galaxies in redshift-space in Python

rate: Reliable Analytic Thermochemical Equilibrium
repack: Repack and compress line-transition data
SARAH: SUSY and non-SUSY model builder and analyzer
SBGAT: Small Bodies Geophysical Analysis Tool
simuTrans: Gravity-darkened exoplanet transit simulator

SMILI: Sparse Modeling Imaging Library for Interferometry
Specstack: A simple spectral stacking tool
sxrbg: ROSAT X-Ray Background Tool
TP2VIS: Total Power Map to Visibilities
Vevacious: Global minima of one-loop effective potentials generator

Prize established for astronomy research software at UMD

This week, we saw another example of the importance of software in astronomy research. We are pleased to announce the establishment of an annual software prize from the UMD Astronomy Department for research software written by a registered undergraduate or graduate student while the student was at University of Maryland, College Park. The source code of the software must be open and assigned an acceptable open source license.

The prize consists of a certificate and a cash award, which will be presented at the Astronomy Department annual award ceremony. We reserve the right to withhold the prize if the criteria are not sufficiently met.

More details will be available later; if you would like additional information, please contact us.

Peter Teuben, teuben@astro.umd.edu
Alice Allen, aallen@ascl.net

 

February and March 2019 additions to the ASCL

Twelve codes were added to the ASCL in February, 2019:

dyPolyChord: Super fast dynamic nested sampling with PolyChord
ExPRES: Exoplanetary and Planetary Radio Emissions Simulator
GraviDy: Gravitational Dynamics
LiveData: Data reduction pipeline

LPNN: Limited Post-Newtonian N-body code for collisionless self-gravitating systems
PINT: High-precision pulsar timing analysis package
PyMF: Matched filtering techniques for astronomical images
Radynversion: Solar atmospheric properties during a solar flare

RPFITS: Routines for reading and writing RPFITS files
SNTD: Supernova Time Delays
Specutils: Spectroscopic analysis and reduction
SpecViz: 1D Spectral Visualization Tool

And sixteen codes were added to the ASCL in March, 2019:

allesfitter: Flexible star and exoplanet inference from photometry and radial velocity
AsPy: Aspherical fluctuations on the spherical collapse background
brutifus: A Python module to post-process datacubes from integral field spectrographs
DAVE: Discovery And Vetting of K2 Exoplanets

GalIMF: Galaxy-wide Initial Mass Function
Galmag: Computing realistic galactic magnetic fields
HelioPy: Heliospheric and planetary physics library
ICSF: Intensity Conserving Spectral Fitting

NFWdist: Density, distribution function, quantile function and random generation for the 3D NFW profile
NIFTy5: Numerical Information Field Theory v5
PLATON: PLanetary Atmospheric Transmission for Observer Noobs
PRF: Probabilistic Random Forest

SimSpin: Kinematic analysis of galaxy simulations
SIXTE: Simulation of X-ray Telescopes
SPICE: Observation Geometry System for Space Science Missions
SpiceyPy: Python wrapper for the NAIF C SPICE Toolkit

Research Data Alliance Plenary 13 presentation

The ASCL is participating in the Research Data Alliance (RDA) meeting currently underway in Philadelphia, PA. The Plenary 13 meeting motto is “With Data Comes Responsibility.” Indeed! Among the sessions of special interest for software folks was yesterday’s Interest Group meeting on Software Source Code and today’s first meeting of a new Working Group on Software Source Code Identification. The Working Group is led by Roberto Di Cosmo, who is a founder of Software Heritage, Martin Fenner from DataCite, and Daniel Katz from the University of Illinois. This initial meeting is titled “Identifying, referencing and citing the source code of research software: a state of the art.” The ASCL is doing a short presentation that focuses on a few of our practices, how we do them, and the rationale for them; this includes what we do when we process a submission, what metadata for software we do and don’t have and why, and some of our curation practices. Our slides for this presentation are available below.

Photograph of Alice presenting a slide

Photo courtesy of @StephvandeSandt

Our attendance at this meeting was made possible with support from Software Heritage; our thanks to that organization!

Slides (PDF)

The ADASS Time Domain Astronomy Hackathon, part 2

This is a continuation of a previous post, and was written by Brian Thomas, Alice Allen, Marc W. Pound, and Peter Teuben.


Lessons Learned
As this was the first such event of this type for ADASS we were unsure of the outcome, as it was somewhat of an experiment. We share some lessons learned for future events.

  1. Provide a list of interesting problems and related clean data. Doing so helps to bootstrap project ideas, as not all participants will have enough domain background to start quickly.  Because the event was so short, it was helpful to provide microservices and point to  datasets that were more or less cleaned and ‘ready to go’ for projects directed at these problem areas.
  2. Develop a marketing plan. We could have done a better job to garner interest in the event. We posted to a community BBS, a UMD subreddit, posted paper flyers in campus science and engineering buildings, and contacted student groups and faculty to help spread the word. However, we did not have a coordinated campaign that included social media and messaging targeted for specific dates and groups (e.g., “Save The Date” emails), nor was the hackathon mentioned in the ADASS registration form. A competing, large, all-women hackathon (https://gotechnica.org/) held the same weekend on campus also affected our enrollment.
  3. Venue (location and time) is important. The university was a good choice because of easy access to rooms, wifi, and food choices. Holding the hackathon at a large academic institution ensured that it would be easy for younger participants (undergrads) to attend, as did holding the event over a weekend to avoid conflicting with classes.
  4. Have an assessment tool/strategy. An exit survey or ending discussion with participants can help improve subsequent hackathons. We failed to take advantage of the opportunity to engage either the participants or the ADASS audience at the session where winning projects were presented about perceived problems and good aspects of our event.
  5. Narrow the range of participant experience. Future organizers should consider either limiting participation to non-professionals, or group the participants and awards into professional and non-professionals. It is somewhat unfair to have less experienced coders compete against domain specialists and possibly contrary to the avowed desire to use this event to advertise our field of work to outsiders.
  6. Time management is crucial. Scheduling a conference event right at the end of the hackathon was problematic, and not tightly managing the final presentation time and similar issues became important and detracted from the event. This will be particularly important in other events that have larger participation.

Conclusions
A community lives and dies by how well it nurtures the next generation. Folks enter the ADASS community by a number of means but typically by being either scientists who become attracted to the technical challenges of writing the software or as computer engineers and programmers who find the science use cases particularly interesting. We are not aware of any organized means to train the next generation of ADASS workers; there are no formal degree programs in “Astronomy Software.” As such, our community has taken a somewhat laissez-faire approach to training the next generation and this may lead to a future deficit in skilled professionals willing to work in our field. More and more our community’s skills are being found useful in application elsewhere; for example, many ADASS attendees can easily become highly sought after Data Scientists.

Hackathons are a step towards being more proactive in our outreach and provide an ideal means to encourage and interest a younger group of programmers in the complex and interesting challenges that our community tackles. We found a number of lessons in hosting this event but no showstoppers, and a good deal of goodwill was generated. Based on our experience, we heartily recommend that future ADASS events include hackathon events.

Acknowledgments. We would like to thank the City of College Park for providing the prize money, Vigilante Coffee for supplying much needed coffee, ASCL for providing snacks and the University of Maryland Astronomy Department for hosting the hackathon.

The ADASS Time Domain Astronomy Hackathon, part 1

This post was written by Brian Thomas, Alice Allen, Marc W. Pound, and Peter Teuben, and, with part 2, will appear in the ADASS XXVIII proceedings.
Brian is with the Office of Chief Information Officer, NASA HQ, Washington DC; Marc, Peter, and Alice are in the Astronomy Department at the University of Maryland in College Park, MD.


In this post, we describe the ADASS XXVIII hackathon, the first associated with an ADASS conference, and provide our motivation and the details of the event. A subsequent post discusses the lessons we learned from holding this event and our conclusions about it.

Introduction
A hackathon seeks to draw together a large group of folks for an intense and extended period  of creative programming. Hackathons may be held for a variety of purposes including, but not limited to, teaching (Huppenkothen et al. 2018), to draw together a technical community as a social event (Kellogg et al. 2019), and to draw attention to solving particular challenges or themes (as found, for example, on popular sites such as Kaggle). Pa Pa Pe Than et al. (2018) provides a broader overview of hackathon applications and uses.

Our motivation for holding a hackathon associated with the ADASS XXVIII meeting was aligned with outreach to interested individuals; we wanted to highlight topical technical problems that the ADASS community might be concerned with and introduce a new generation of rising computer programmers and scientists to the excitement of solving them. We chose the topic area of Time Domain Astronomy (TDA) to focus on for this event as it was also one of the themes for this year’s ADASS meeting and aligned well with the interests of the Department hosting the hackathon. We allowed a loose definition of TDA, dealing with any astronomical data where time was a parameter. Thus projects for this hackathon could involve, for example, variable stars, exoplanets, and bodies in the solar system.

Event Organization
The ADASS hackathon took place the weekend before the ADASS starting on Saturday morning and ending at noon on Sunday with the total event time being 27 hours. We provided a space in the University of Maryland Physical Sciences Complex (PSC) as well as snacks and coffee. The participants were required to attend the introduction and be present for final presentations at 11am on Sunday. Otherwise, they could stay in the PSC building or leave as they desired. A cash award (provided by the City of College Park) was available for the top 3 teams with $500, $350 and $150 being awarded to the first, second and third place teams respectively. The winning team was also provided time to present their hack during the ADASS meeting.

We began by having the participants introduce themselves, their backgrounds and interests. We then introduced the participants to the field of TDA, providing some general background and challenges in this area. Presentations were given by Charlotte Ward (UMD graduate student), Gerbs Bauer (UMD Research Professor), and Brian Thomas (NASA). We highlighted some datasets which could be applied to solving aspects of the challenges. This was followed by a freely flowing brainstorming session where people could discuss ideas and questions, and potential hacks could be focused. Ideas were placed on sticky notes on a wall. Participants were then allowed a short period of time to form teams and brainstorm. After another hour or so, each team presented an outline of their hack, potentially allowing members to join another team if skill sets were better suited elsewhere. In our case nobody decided to join another team.

We allowed for a range of project types. Projects could be new analyses or approaches or novel ways of understanding existing solutions or problems. The final product could be a proof-of-concept app, a plugin to existing code, a storyboard design, or really anything that embodies creative hacking around the TDA theme. We did not require that the final project be polished; a good idea that was well fleshed out could also be submitted. A final presentation of a few slides describing the work including the motivation and approach was the only requirement for consideration for a prize.

We used Devpost to help structure the hackathon. This site served as a centralized location from which information could be disseminated including rules of conduct and a discussion board which we used to distribute ideas and answer participant questions. Hackathon rules can be summarized as follows:

  • Each participant belongs to one team and one final submission, but is allowed to switch teams. Team makeup is not final until the presentation. The maximum team size was 5.
  • Only 1 submission per team.
  • A Code of Conduct. We did not tolerate harassment of hackathon participants in any form, including, but not limited to, harassment based on gender identity and expression, age, sexual orientation, disability, physical appearance, body size, race, ethnicity, nationality, religion, political views, previous hackathon attendance, lack of computing experience, or chosen programming language or tech stack. Sexual language and imagery was not appropriate at any point in the hackathon including in software hacks, social media, talks, presentations, or demos.

Hackathon participants violating any of these rules could be sanctioned or expelled
from the hackathon at the discretion of the hackathon organizers.

Participants
Our event was set up as a community hackathon and attracted students, professional hackathonners, and ADASS participants who formed teams (see below). Members of the hosting department and the ADASS program organizing committee served as judges. Out of the 34 original registrations, 6 were present but not playing (being part of the organization or just cheerleading), and 9 did not show up.

Judges, Organizers, and Teams
The session was organized by Peter Teuben, Brian Thomas, Alice Allen, Marc Pound, and Elizabeth Warner. Our judges were Alice Allen, Gerbs Bauer, Andy Harris, Nuria Lorente, Ada Nebot, and Brian Thomas. The 7 teams that participated are listed in Table 1. We have also noted which teams won which prizes.

Team members Project name
Sarah Frail and Patrick Shan Morpheus – Near Earth Objects Visualization
Marco Lam Drag and drop ensemble (2nd prize)
Paul Ross McWhirter and Josh Veitch-Michaelis Auto periodogram selection using MC (3rd prize)
Timothy Henderson and Matt Graber Solar Activity Viewer
Thomas Boch, Matthieu Baumann, and Siddha Mavuram Music of Light curves (1st prize)
Kyle Kaplan, Sankalp Gilda, Hayden Hotham, Steve Gambino, and Abbie Petulante ML on ZTF pipeline
Kevin Cai, Kael Lenus, James Zhou, and Justin Otor Fixed and Variable Time Kepler Viewer in WWT

Table 1. Hackathon Teams

The winning team “The Music of Light Curves” made their hack, the sonification of variable stars from the Gaia catalogue, available on https://tboch.github.io/music-lightcurves-hack/. Their presentation to the ADASS audience during the TDA session on Wednesday met with resounding applause (and later a mention in the international press).


Continue to Part 2, Lessons Learned and Conclusions

A visit to NASA’s Goddard Space Flight Center

Photo showing slide of new journals friendly to astro computing articles started since 2012

At the podium

Although I was born in Washington, DC and have spent most of my life in its Maryland suburbs, yesterday was my first time on the Goddard campus (aside from its Visitor Center, which I’ve been to many times), this despite having two family members and many friends who used to1 or do work there. I was excited! And I had a great reason for going: I was presenting a talk to the Astrophysics Science Division titled “Make your research software famous! (or at least discoverable).” The talk, broadcast on a NASA UStream channel and recorded for future viewing,2 covered a bit about our research on source code availability in astronomy, and also covered our current project to make NASA astro research software more discoverable, what the Astrophysics Source Code Library is and how it improves research transparency, software citation, and recent changes in publishing with regard to software that, combined with other changes in the community and science in general, make it easier than ever before to make one’s astro research software discoverable. The slides I presented are available for download (PDF), and links to different resources, journals, and organizations that I mentioned in the talk are also available.

Kristin Rutkowski, along with Tess Jaffe and Alex Reustle, hosted my visit to GSFC; I had met both Kristin and Tess at last year’s ADASS conference in College Park, where we had our first conversation about my visiting the site to talk about the ASCL. Yesterday’s audience was great; they were involved and asked a lot of excellent questions, about copyright, code authors not receiving credit for the software they write, how we handle dead links, mutable author lists, NASA policies regarding software release, and how the ASCL is funded. Some of the questions came from people attending remotely and were asked online; Alex made sure these were covered, too. Alex is also involved in making the video of the talk available online, and when it is available, I’ll update this post with its link.

Photo of NASA's Space Environment Simulator

Space Environment Simulator

After my presentation, Kristin and Tess took me to see some of the NASA labs and equipment, including the Space Environment Simulator Facility, the JWST/OTIS Vibration Test System, the currently out-of-service High Capacity Centrifuge, and the Acoustic Test Cell. We went through doors marked “Authorized Personnel Only”!! This is one of the perqs of working on the ASCL — I become “Authorized Personnel” when visiting telescopes and labs and such, which, to me, is very cool and exciting! Sure, it’s only for a few minutes and always in the company of others who have far more business being there than I do, but still: very cool and exciting!! After looking at these labs and equipment, Kristin and I said goodbye to Tess, and then drove over to see dinosaur footprints that had been found on the Goddard campus. (Could a visit anywhere be any cooler?!?!)

Dinosaur and small mammal tracks

Science science everywhere! I had a great time at Goddard, and thank Alex and Tess and especially Kristin for hosting my visit!

 

 

 

 

 

 

 

1 Happy retirement day, Janie!
2 No, that’s not nerve-wracking at all, so long as one doesn’t think about it.

Resources mentioned in NASA GSFC presentation on making research software more discoverable

Presentation slides (PDF)

Journals

Journal of Open Source Software (JORS)

Astronomy and Computing (A&C)

Computational Astrophysics and Cosmology (ComAC)

SoftwareX

Journal of Open Source Software (JOSS)

Research Notes of the AAS

Change leaders and guidelines

Force11/Force11 Software Citation Principles

CodeMeta

Working toward Sustainable Software for Science: Practice and Experiences (WSSSPE)

FAIR principles

Social coding sites and archival services

Bitbucket

GitHub

Figshare

Zenodo

Other resources

Asclepias

arXiv/arXiv Next Generation

DataCite