Eight Years and the 8th Most Cited Paper from Galaxy Zoo

At Galaxy Zoo we’re really proud of our publication record – 48 papers and counting, just from the team using your classifications. In academic research one of the most important numbers a published paper has is the number which counts how many citations that paper has – simply a count of the number of other academic publications mention your work.

And we’re not only proud of the Galaxy Zoo publication record, but the citation record is becoming impressive too (if we do say so ourselves). For this post in the lead up to the 8th anniversary of the launch of Galaxy Zoo, here are the 8 most cited of our papers:

1. Lintott et al. 2008: “Galaxy Zoo: morphologies derived from visual inspection of galaxies from the Sloan Digital Sky Survey “(with 279 citations)

2. Bamford et al. 2009: “Galaxy Zoo: the dependence of morphology and colour on environment” (219 citations)

3. Lintott et al. 2011: “Galaxy Zoo 1: data release of morphological classifications for nearly 900 000 galaxies” (152 citations)

4. Skibba et al. 2009: “Galaxy Zoo: disentangling the environmental dependence of morphology and colour” (114 citations)

5. Schawinski et al. 2010: “Galaxy Zoo: The Fundamentally Different Co-Evolution of Supermassive Black Holes and Their Early- and Late-Type Host Galaxies” (102 citations)

6. Cardamone et al. 2009: “Galaxy Zoo Green Peas: discovery of a class of compact extremely star-forming galaxies” (101 citations)

7. Darg et al 2010: “Galaxy Zoo: the properties of merging galaxies in the nearby Universe – local environments, colours, masses, star formation rates and AGN activity” (92 citations)

8. Masters et al. 2010: “Galaxy Zoo: passive red spirals” (86 citations)

I’m personally especially proud of paper number 8 on that list, because it is one of the first papers I led making use of Galaxy Zoo classifications (and one of my most cited first author papers in fact). In that paper we explored the properties of the unusually passive (ie. not star forming) red spirals that had been noted in both Bamford et al. 2009 and Skibba et al. 2009. For astronomers this is one of the more well known discoveries from Galaxy Zoo, and these passive red spirals continue to be studied for what they can reveal about the modes of evolution of galaxies in our Universe, and that many spirals must stop forming stars before they lose their spiral structure.

blue_red_sp_ell

A red elliptical and blue spiral (top), with a blue elliptical and red spiral (lower).

(By the way for academics who might be interested the h-index of Galaxy Zoo is 24).

Eight Years and the 8 Most Talked-About Galaxies in Galaxy Zoo

Continuing the countdown to Galaxy Zoo’s 8th birthday, below are 8 of the most-commented-on galaxies in the active Galaxy Zoo. They range near (in astronomical terms) and far, from gorgeous disks to space-warping groups, and some of them aren’t even galaxies at all!

8. Galaxies Interacting (Arp 112)

image of Arp 112

#merger #arc #g-pair #bulge #tidaltails #ugc #wow #agn #ngc #ngc7806 #arp #markarian #dustlane #available_in_dr7 #spiral #gpair #awesome #tidal #lens #no_lens

A lovely example of the diversity of structures in the Universe. The central galaxy may have been a perfectly symmetric spiral before it was seriously disturbed by the elliptical galaxy on the left side of the shot, and what’s that wispy thing off to the right? Is it a former part of the central galaxy? And what is this all going to look like in a few billion years? Whatever happens, the volunteers made it clear this is a special one to classify and to look at.

7. A Beautiful Lens

Gravitational lens in CANDELS

#lens #lensing #knownlens #arc #lense #interesting

This gorgeous gravitational lens was spotted almost immediately upon the launch of the new Galaxy Zoo within the high-redshift CANDELS data. It generated multiple lively discussions and scientists and volunteers alike weighed in with further information. It turned out in this case that this was one of very few lenses that were already known, but there are likely still unknown lenses buried in the data, waiting to be discovered!

6. Emerald in the sky

image of this mysterious green-gem object

#quasar #edge #lbg #star

Initially identified as a high-redshift star-forming galaxy by one of our seasoned volunteers, a number of people subsequently looked further into the existing scientific literature. There was a lot of debate about this particular point of light, but in the end the volunteers uncovered a later paper confirming that this green gem (which would actually be either very red or nearly invisible to the human eye, as it’s “green” because it only shows up in the infrared filters used for this image) is actually just a star in our galaxy. Bummer, maybe, but this process is also an important part of science.

5. Lord of the Polar Ring

image of polar ring galaxy

#dustlane #polar #polarring #beautiful #polar-ring #elliptical #ring #edgeon #mothership #dust #polaring #question

This spectacular example of a polar ring galaxy couldn’t have been found in the original Galaxy Zoo or Galaxy Zoo 2, because it only made it into the Sloan Digital Sky Survey when the sky coverage was extended.

4. Collisional Ring (Arp 148)

Mayall's Object in the Sloan Digital Sky Survey

#merger #arp148 #arp #available_in_dr7 #lookalike #alphabet #ring

It takes a special kind of galaxy crash to make a collisional ring, and you can see this one in progress. It reminded our volunteers and scientists of the Cartwheel galaxy, another spectacular example of these snapshots of a brief moment in time.

3. Triangular Galaxy!

A disturbed spiral galaxy.

#merger #odd #dark #needle #holycow #wow #doublenucleus #tidaldebris #disturbed #rocket #cluster #irregularshape #spaceship #rocketship

Well, this is odd. This galaxy looks like it’s on its own, but it has a rather unusual shape that would usually imply some sort of interaction or collision. Our volunteers discussed what could be causing it – until they viewed a zoomed-out image and it became clear that this galaxy has recently flown by a trio of galaxies, which would be more than enough to disrupt it into this lovely shape.

triangle galaxy and a nearby group of galaxies

2. Hubble Resolves the Distant Universe

edge-on disk galaxy with dust lane

#spiral #overlap #dustlane #starburst #edge-on #edgeon

When a new batch of data taken by the Hubble Space Telescope appeared on the latest Galaxy Zoo, this was one of the first stunners remarked on by several people. Some of the parts of the sky covered by Hubble coincide with the Sloan Digital Sky Survey, and we linked the surveys up via Talk. Our tireless volunteers launched a thread collecting side-by-side images from SDSS and Hubble, showcasing the power of the world’s greatest space telescope. Hubble’s primary mirror is about the same size as that used by the SDSS, so the differences between the images of the same galaxy are due to the blurring effect of the atmosphere.

SDSS (on Earth) at left, HST (in space) at right.

SDSS (on Earth) at left, HST (in space) at right.

And, the most talked about image in the latest Galaxy Zoo is…

1. ……..really? this?

A barely-there galaxy.

It’s always the galaxies you least expect.

Okay, okay… If you saw this and said it looks like there isn’t a lot to talk about here, I wouldn’t blame you. And, indeed, there’s only one “short” comment from one of our volunteers, who used our Examine tools and discovered that this little blotch appears to be a very high-redshift galaxy.

However, that same volunteer also started a discussion with the question: just for fun, what’s the highest redshift you’ve found? Others responded, and thus began a quest to find the galaxy in Galaxy Zoo that is the farthest distance from us. This discussion is Galaxy Zoo at its finest, with new and experienced volunteers using the project as inspiration for their own investigations, scouring the scientific literature, and learning about the very early Universe.

It seems like the most likely known candidate so far is a quasar at a redshift of about 5.5 (at which point the Universe was about 1 billion years old), or, if you don’t think a quasar counts, an extended galaxy at = 4 or so (1.5 billion years old). But there’s just so much science wonderfulness here, all of it from our fantastic volunteers, and it all started with a patchy blob and a sense of curiosity.

Galaxy Zoo started with a million blobs (ish) and a sense of adventure. I think that’s fitting.

Eight years, eight Hubble Voorwerpje targets

It’s a week until the 8th anniversary of the launch of Galaxy Zoo.

The Hubble Space Telescope observations of giant ionized Voorwerpje clouds near galaxies with active nuclei, many found for the first time though the effort of Galaxy Zoo participants gives us another 8 – one at the end of a long road of numbers. 16,000 galaxies with known or possible active nuclei, 200 highly-ranked cloud candidates based on input from 185 participants, 50 spectroscopic observations, 19 giant ionized clouds, among which we found 8 with evidence that the nucleus has faded dramatically (and then observed by one Hubble Space Telescope). (You wondered where the numeral 8 would come in by now… and there is another one hidden below.) The first batch of scientific results from analysis of these images was described here, and the NASA/ESA press release with beautiful visualizations of the multi-filter image data can be seen here. As a visual summary, here are the images, with starlight and emission from [O III] and H-alpha shown in roughly true visual color.

Hubble images of Voorwerpje clouds

This project was an outgrowth of the discovery of Hanny’s Voorwerp, which remains probably the signature discovery of Galaxy Zoo. In astronomy, one is a pet rock, ten is a statistically valid sample – so we wanted to know more about how common such clouds might be, and what they could tell us about quasars more generally. Zoo participants answered this challenge magnificently.

The scientific interest in these objects and their history remains intense, and observations continue. I’ve recently finished processing integral-field spectra from the 8-meter Gemini-North telescope, where we have spectra at every point in a small field of view near the nucleus, and just recently we learned that our proposal for spectra in a few key areas at the high resolution of the Hubble telescope has been approved for the coming year.

Even (or especially) for kinds of objects behind its original statistical goals, Galaxy Zoo has provided an amazing ride these last 8 years. Stay with us – and if you see weirdly colored clouds around galaxies, feel free to flag them in Talk!

A Galaxy Walks Into A Bar…

4 barred galaxies from Galaxy Zoo projects past

We are pleased to announce that a Galaxy Zoo project is one of the first projects built on the new Zooniverse! Several years ago we measured the lengths of galactic bars in relatively nearby galaxies in the Sloan Digital Sky Survey, and Ben Hoyle wrote an excellent paper presenting new an interesting results on how bars, which are a distinct feature caused by a change in the nature of the orbits of some of the stars in a galaxy, relate to other physical properties of the galaxy, such as color (indicative of recent star formation) and the nature of spiral arms or rings. That work showed the power of measurements like these, which are not always easy for computers to get right.

Today, we’re hoping you’ll help us extend that set of detailed galaxy measurements into the distant Universe, with measurements of bars in about 8,000 galaxies from our previous projects using Hubble Space Telescope data, including the AEGIS, CANDELS, COSMOS, GEMS and GOODS surveys.

We’ve deliberately been pretty broad in our selection of galaxies which may have a bar, so the first thing the project asks you is to confirm whether you think the galaxy does indeed have one. There are many examples of barred and not-barred galaxies (including examples of sort-of-looks-like-barred-but-actually-isn’t-and-here’s-why) included in the project, and you can access them anytime by clicking the “Need some help?” button.

Galaxy Zoo Bar Lengths classification interface screen shot

We’ve also zoomed in on the central galaxy to make it easier to classify.

If the galaxy doesn’t have a bar, then you can move on to the next one. If it does, there are some follow-up questions about spiral arms and rings, and then we ask you to draw 2 lines on the image: one for the bar width and one for its length.

You can also join in the discussions after the classifications with our new Talk discussion tool, which is completely separate from the main Galaxy Zoo Talk (just like the rest of the project).

On a more personal note, this is a big step forward for the Zooniverse as a whole. The first draft version of this project came together in under 1 hour back in April. Afterward, we shared project links between science team members and iterated back and forth on the right questions to ask and the right data to use. This process would normally take at least 6 months and require a lot of one-on-one time with a Zooniverse developer. Instead, because the Zooniverse development team has done a brilliant job creating a Project Builder that’s flexible, powerful and also easy to use, we were able to create a new project in a way that’s analogous to, well, creating a blog.

In these early days of the new site’s release I’m sure there will be some bugs that need zapping, but even so the new capabilities of the Zooniverse are phenomenal. I suspect this is just the first of many new projects to be spun up in the New Zooniverse. (In fact, there are 3 more projects debuting alongside ours.)

Try it out here: Galaxy Zoo: Bar Lengths


bars written in galaxies

Now back in Technicolor!

The science team and I want to thank to everyone who’s helped participate in the last month of classifications for the single-band Sloan Digital Sky Survey images in Galaxy Zoo, which were finished last night! The data will help us answer one of our key science questions (how does morphology change as a function of observed wavelength?), helping explore the role played by dust, stellar populations of different ages, and active regions of star formation. Researchers, particularly those at the University of Portsmouth, are eager to start looking at your classifications immediately.

Not saturated enough to be Technicolor, strictly speaking.

“Look, Toto – Galaxy Zoo’s back in color!” (Image courtesy MGM/Ryan McCormick)

In the meantime, we’re returning to images that are likely more familiar to many volunteers: the SDSS gri color images from Data Release 8. These galaxies still need more data, especially for the disk/featured galaxies and detailed structures. However, we should have two new sets of data ready for classification very soon alongside the SDSS, including a brand-new telescope and something a little different than before.

Please let us know on Talk if you have any questions, particularly if you have feedback about the single-band images or the science we’re working on. Thanks again!!

Stellar Populations of Quiescent Barred Galaxies Paper Accepted!

A new paper using Galaxy Zoo 2 bar classification has recently been accepted!

In this paper (which can be found here: http://arxiv.org/abs/1505.02802), we use hundreds of SDSS spectra to study the types of stars, i.e., stellar populations, that make up barred and unbarred galaxies. The reason for this study is that simulations predict that bars should affect the stellar populations of their host galaxies. And while there have been numerous studies that have addressed this issue, there still is no consensus.

A graphic summary of this study is shown here:

stellarpop_bars_fig1_2015-02-16

In this study, we stack hundreds of quiescent, i.e., non-star-forming, barred and unbarred galaxies in bins of redshift and stellar mass to produce extremely high-quality spectra. The center-left panel shows our parent sample in grey, and the cyan and green hash marks represent our galaxy selection for our bulge and gradient analysis. The black rectangle represents one of the bins we use. The upper and lower plots show the resultant stacked spectra of the barred and unbarred galaxies, respectively. We show images of barred and unbarred galaxies in the center, selected with the Galaxy Zoo 2 classifications. Finally, the center-right panel shows the ratio of these two stacked spectra at several wavelengths that reflect certain stellar population parameters.

Our main result is shown here:

stellarpop_vs_mass_fig2_2015-02-16_alf

We plot several stellar population parameters as a function of stellar mass for barred and unbarred galaxies. Specifically, we plot the stellar age, which gives us an idea of the average age of a galaxy’s stars, stellar metallicity ([Fe/H]), which gives us an idea of the relative amount of elements heavier than hydrogen in a galaxy, alpha-abundance ([Mg/Fe]), which gives us an idea of the timescale it took to form a galaxy’s stars, and nitrogen abundance ([N/Fe]), which also gives us an idea of the timescale it took to form a galaxy’s stars.

The main result of our study is that there are no statistically significant differences in the stellar populations of quiescent barred and unbarred galaxies. Our results suggest that bars are not a strong influence on the chemical evolution of quiescent galaxies, which seems to be at odds with the predictions.

Finished with Hubble (for now), with new images going back to our “local” Universe

Thanks for everyone’s help on the recent push with the Hubble CANDELS and GOODS images. I’m happy to say that we’ve just completed the full set, and are working hard on analysis of how the new depths change the morphologies. In the meantime, we’re delighted to announce that we have even more new images on Galaxy Zoo!

The new set of images now active are slightly different for us, and so we wanted to explain here what they are and why we want to collect classifications for them.

In all phases of Galaxy Zoo so far we have shown you galaxy images which are in colour. The details of how these are created varies depending on which survey the images are from. With the SDSS images, we combine information from three of the five observational filters used by Sloan (g, r, i) to produce a single three-colour image for each galaxy. We’ve talked before in more detail about how those colour images are made. All five Sloan filters and their wavelengths and sensitivity are shown below. You can probably see why we’d pick gri for our standard colour images: these are the most sensitive filters, roughly in the “green”, “red” and “infrared” (or just about) parts of the spectrum.

SDSS Filters

The five SDSS filters and the wavelengths they span.

 

Each of the SDSS filters is designed to observe the galaxy at a different part of the visible (or near visible) spectrum, with the bluest filter (the u-band; just into the UV part of the spectrum) and the reddest the z-band (which is into the infra-red). Different types of stars dominate the light from galaxies in different parts of the spectrum, for example hot massive young stars are very bright in the u-band, while dimmer lower mass stars are redder. Galaxies with older populations of stars will therefore look redder, as the massive blue stars will all have gone supernova already.

We are interested in measuring how a galaxy’s classification differs when it’s observed in each of the filters individually. To investigate this specific question, we have put together a selection of SDSS galaxies and instead of showing you a single three-colour image for each, we are showing you separately the original single filter images. We want you to classify them just as normal, and we will use these classifications to quantify how the classification changes from the blue to the red images.

Example postage stamp images of the monochromatic single filter images.

Example postage stamp images of the monochromatic single filter images.

Astronomers have a good “rule of thumb” for what should happen to galaxy morphology as we move to redder (or bluer) filters, but it’s only ever been measured in very small samples of galaxies. With your help we’ll make a better measurement of this effect, which will be really useful in the interpretation of other trends we observe with galaxy colour.

(Hint: some users might want to use the “Invert” button on the Galaxy Zoo interface a little bit more for these images, as some galaxies are more clearly seen when you toggle it.)

Explore Galaxy Zoo Classifications

This post (and visualization) is by Coleman Krawczyk, a Zooniverse Data Scientist at the ICG at the University of Portsmouth

Today we’ve added another new tool for visualizing Galaxy Zoo, this time showing the full vote path of all users for each galaxy from GZ2 onward.  The first node of the visualization shows an image of the galaxy and each of the other nodes represents the answer to a question from the Galaxy Zoo decision tree, and the size of the node is proportional to the number of votes for that answer.  The maximal vote path is highlighted and also shown in words across to top of the tree, and the results of the “Is there anything odd?” question are shown across the bottom.
The full Galaxy Zoo catalog can be searched via Zooniverse ID (the same one used for Talk), RA and Dec, or randomly.  After picking a galaxy the nodes can be moved around by clicking and dragging, and the links can be collapsed/expanded by clicking the attached nodes, both of these functions are useful for untangling complex trees.  Various properties of the visualization can also be controlled with the sliders below the tree.  For a guided tour of this tool click the “Take a tour” button, and for a full list of features click the “Help” button.
Screenshot of the Visualisation Tool

Screenshot of the Visualisation Tool

Visualizing the decision trees for Galaxy Zoo

This post (and visualization) is by Coleman Krawczyk, a Zooniverse Data Scientist at the ICG at the University of Portsmouth

Today we’ve added a new tool that visualizes the full decision tree for every Galaxy Zoo project from GZ2 onward (GZ1 only asked users one question, and would make for a boring visualization).  Each tree shows all the possible paths Galaxy Zoo users can take when classifying a galaxy.  Each “task” is color-coded by the minimum number of branches in the tree a classifier needs to take in order to reach that question.  In other words, it indicates how deeply buried in the tree a particular question is, a property that is helpful when scientists are analyzing the classifications.

Galaxy Zoo has used two basic templates for its decision trees.  The first template allowed users to classify galaxies into smooth, edge-on disks, or face on disks (with bars and/or spiral arms) and was used for Galaxy Zoo 2, the infrared UKIDSS images, and is currently being used for the SDSS data that is live on the site. The second template was designed for high-redshift galaxies, and allows users to classify galaxies into smooth, clumpy, edge on disks, or face on disks. This template was used for Galaxy Zoo: Hubble (GZ3), FERENGI (artificially redshifted images of galaxies), and is currently being used by the CANDELS and GOODS images in GZ4.  Although these final three projects ask the same basic questions, there are some subtle differences between them in the questions we ask about the bulge dominance, “odd” features, mergers, spiral arms, and/or clumps.

Visualization of the decision tree for Galaxy Zoo 2 (GZ2), by C. Krawcyzk. Colors indicate the depth of a particular question within the decision tree.

Visualization of the decision tree for Galaxy Zoo 2 (GZ2), by C. Krawczyk. Colors indicate the depth of a particular question within the tree.

If you ever wanted to know all the questions Galaxy Zoo could possibly ask you, head on over to the new visualization and have a look!

New paper: Galaxy Zoo and machine learning

I’m really happy to announce a new paper based on Galaxy Zoo data has just been accepted for publication. This one is different than many of our previous works; it focuses on the science of machine learning, and how we’re improving the ability of computers to identify galaxy morphologies after being trained off the classifications you’ve provided in Galaxy Zoo. This paper was led by Sander Dieleman, a PhD student at Ghent University in Belgium.

This work was begun in early 2014, when we ran an online competition through the Kaggle data platform called “The Galaxy Challenge”. The premise was fairly simple – we used the classifications provided by citizen scientists for the Galaxy Zoo 2 project and challenged computer scientists to write an algorithm to match those classifications as closely as possible. We provided about 75,000 anonymized images + classifications as a training set for participants, and kept the same amount of data secret; solutions submitted by competitors were tested on this set. More than 300 teams participated, and we awarded prizes to the top three scores. You can see more details on the competition site.

Since completing the competition, Sander has been working on writing up his solution as an academic paper, which has just been accepted to Monthly Notices of the Royal Astronomical Society (MNRAS). The method he’s developed relies on a technique known as a neural network; these are sets of algorithms (or statistical models) in which the parameters being fit can change as they learn, and can model “non-linear” relationships between the inputs. The name and design of many neural networks are inspired by similarities to the way that neurons function in the brain.

One of the innovative techniques in Sander’s work has been to use a model that makes use of the symmetry in the galaxy images. Consider the pictures of the same galaxy below:

Screen Shot 2015-03-27 at 4.16.07 PM

A galaxy from GZ2, shown both with no rotation (left) and rotated by 45 degrees (right).

From the classifications in GZ, we’d expect the answers for these two images to be identical; it’s the same galaxy, after all, no matter which way we look at it. For a computer program, however, these images would need to be separately analyzed and classified. Sander’s work exploits this in two ways:

  1. The size of the training data can be dramatically increased by including multiple, rotated versions of the different images. More training data typically results in a better-performing algorithm.
  2. Since the morphological classification for the two galaxies should be the same, we can apply the same feature detectors to the rotated images and thus share parameters in the model. This makes the model more general and improves the overall performance.

Once all of the training data is in, Sander’s model takes images and can provide very precise classifications of morphology. I think one of the neatest visualizations is this one: galaxies along the top vs bottom rows are considered “most dis-similar” by the maps in the model. You can see that it’s doing well by, for example, grouping all the loose spiral galaxies together and predicting that these are a distinct class from edge-on spirals.

From Figure 13 in Dieleman et al. (2015). Example sets of images that are maximally distinct in the prediction model. The top row consists of loose winding spirals, while the bottom row are edge-on disks.

From Figure 13 in Dieleman et al. (2015). Example sets of images that are maximally distinct in the prediction model. The top row consists of loose winding spirals, while the bottom row are edge-on disks.

For more details on Sander’s work, he has an excellent blog post on his own site that goes into many of the details, a lot of which is accessible even to a non-expert.

While there are a lot of applications for these sorts of algorithms, we’re particularly interested in how this will help us select future datasets for Galaxy Zoo and similar projects. For future surveys like LSST, which will contain many millions of images, we want to efficiently select the images where citizen scientists can contribute the most – either for their unusualness or for the possibility of more serendipitous discoveries. Your data are what make innovations like this possible, and we’re looking forward to seeing how these can be applied to new scientific problems.

Paper: Dieleman, Willett, & Jambre (2015). “Rotation-invariant convolutional neural networks for galaxy morphology prediction”, MNRAS, accepted.