Astronomy 383L - Fall 2016

Cosmos Seminar

W 3:30 · RLM 15.316B · Not for credit in Fall 2016


The Cosmos seminar is used for scientific seminars, interdisciplinary talks, seminars on innovative teaching techniques, and discussions of research initiatives and strategic priorities. Please contact the Department Chair, Professor Shardha Jogee, to schedule a seminar by sending email to the chair-at-astro account: chair@astro

Schedule

Aug. 24 Shardha Jogee
University of Texas at Austin
Organizational Meeting

Aug. 31 Markus Kissler-Patig
Director, Gemini Observatory
Heidi Hammel
Executive Vice President, AURA
Gemini Observatory Community Event

Sep. 7 No talk scheduled

Sep. 14 Taft Armandroff
The University of Texas at Austin
Progress with McDonald Observatory Initiatives

Sep. 21 Anita Cochran
The University of Texas at Austin
** Starting at 3:00 p.m. this week only **
Update on GMT and Discussion of SAC Technical Document


Sep. 28 No talk scheduled

Oct. 5 Talk rescheduled to October 26
 

Oct. 12 Sean Wang
Director, Data Science at Fidelity Investments
It Doesn't Have to be Rocket Science - Non-academic careers for fun and profit

Oct. 19 Keely Finkelstein
The University of Texas at Austin
Teaching Tools, Tips, and Strategies: Low Stakes Testing, Immediate Feedback Assessment & More

Oct. 26 Niall Gaffney
Director for Data Intensive Computing, Texas Advanced Computing Center

Zhao Zhang
Research Associate, Texas Advanced Computing Center
Recent Developments at TACC



Processing Astronomy Imagery Using Big Data Technology

Scientific analyses commonly compose multiple single-process programs into a dataflow. An end-to-end dataflow of single-process programs is known as a many-task application. Typically, HPC tools are used to parallelize these analyses. In this work, we investigate an alternate approach that uses Apache Spark—a modern platform for data intensive computing—to parallelize many-task applications. Using Apache Spark, we implement Kira, a flexible and distributed astronomy image processing toolkit. We then use the Kira toolkit to implement the Kira SE application for extracting sources from astronomy images. Using Kira SE as a case study, we study the programming flexibility, dataflow richness, scheduling capacity and performance of Apache Spark running on the EC2 cloud. By exploiting data locality, Kira SE achieves a 4.1× speedup over an equivalent C program when analyzing a 1TB dataset using 512 cores on the Amazon EC2 cloud. Furthermore, we show that by leveraging software originally designed for big data infrastructure, we are able to use the Amazon EC2 cloud to achieve a 1.8× speedup over the C implementation running on the NERSC Edison supercomputer, when holding core count constant. Using the same implementation of Kira SE, a 128-core EC2 cloud deployment that uses Spark Streaming can achieve second-scale latency with a sustained throughput of ∼600 MB/s. Our experience with Kira demonstrates that data intensive computing platforms like Apache Spark are a performant alternative for many-task scientific applications.

close


Nov 2 Scott Acton
Ball Aerospace
JWST: An Observatory Beyond the Moon

Nov. 9 On hold
On hold

Nov. 16 Anna Quider
Director of Federal Relations, Northern Illinois University
The Federal Budget: A Primer for Scientists

abstract


Nov. 23 Thanksgiving Holiday
 

Nov. 30 On hold
On hold