An Algorithm to Identify Every Tree

Tree branches over a street in Southern California. Credit: Shutterstock

Stop for a second and look out your window. You’ll likely see a tree somewhere outside. Do you know its species? How often should it be watered? Is it healthy?

Pietro Perona imagines a world where you could look at a tree—or myriad other objects—and, with the help of a smartphone, gain instant expertise about it. While you may not know anything about the item, “someone will know what it is,” says Perona, the Allen E. Puckett Professor of Electrical Engineering at the California Institute of Technology (Caltech). “And if someone knows, [your] iPhone should be able to know as well.”

He doesn’t mean that you’ll just ask Google or Siri. Rather, you’d take a photo of the object—take the tree, for example—and an algorithm used in an app or a website would be able to recognize it by comparing your image to data collected on a variety of tree species. Perona envisions this type of algorithm working for millions of other observable things, from plants and animals to geological formations—even various health-related symptoms, like skin rashes.

Perona researches computer vision, and how to “reproduce the ability of the human visual system in a machine,” he says. For the past few years, he’s been particularly interested in training computers to do fine-grained visual categorization, which is the ability to identify objects very specifically, like a tree down to its species, or a building by its style of architecture. Some companies already use technologies that can do this—Facebook’s software can recognize your friends’ visages in photographs, for example.

In 2009, Perona started exploring the idea of using machine learning to provide on-demand expertise about bird species. It was the first project in an ongoing collaboration with Cornell University computer science professor Serge Belongie called Visipedia, which is essentially a visual encyclopedia that learns from human experts how to classify objects. Their labs worked with bird experts to label thousands of internet images of birds from around the country, annotating distinguishing details about their beaks, eyes, and other morphology. They then fed this data into their algorithm, thereby “teaching” it how to identify birds through their observable traits.

Computer visual algorithms “have strong learning components, so you [can] build a very generic piece of software that can learn from example,” Perona says.

The resulting bird algorithm can so far identify more than 550 bird species in North America, Perona says, and was recently applied to a mobile app called Merlin, which enables users to upload or take a photo of a bird to find out its species.

For his next Visipedia project, Perona is applying the algorithm to trees, having been inspired by the neighborhood conditions around Caltech. “I was somewhat concerned about the future of trees in Pasadena because California was restricting watering, and I could see that many trees were dying,” Perona says. He thought that the algorithm could provide a better street tree inventory, which public officials could draw on to improve their strategies for tending to their urban forests.

Fortunately, satellite imagery services like Google Maps provide a regularly updated and publicly available trove of images of street trees in urban areas like Pasadena. Perona recruited human volunteers to label the trees from non-tree objects in images taken around Pasadena, and then used species information from a previous tree inventory done by the city to train the algorithm’s identification skills.

In a proof-of-concept test comparing the algorithm’s output with the city survey of Pasadena trees, the algorithm correctly detected about 80 percent of the trees, and out of those, identified the correct species about 80 percent of the time, Perona says.

Perona foresees this algorithm being used primarily by city planners across the country. By precluding the need for human surveyors, the technology could potentially save tens of thousands of hours and possibly even millions of dollars, Perona estimates. Moreover, the algorithm could be used to conduct arboreal surveys more regularly, allowing for more updated information on the status of trees in a given area.

“Most cities will survey every 20-30 years, [but] potentially we could have a survey every six months,” Perona says. “We could see how the urban forests in the U.S. are evolving over time much more accurately, how they’re responding to climate change, how they’re responding to a virus that’s spread from one county to the next. It will be more like seeing the stock market or something that changes all the time, and it gets updated rather than [retaining] this very static point of view.”

Meet the Writer

About Chau Tu

@chaubtu

Chau Tu is an associate editor at Slate Plus. She was formerly Science Friday’s story producer/reporter.

Cookie	Duration	Description
_abck	1 year	This cookie is used to detect and defend when a client attempt to replay a cookie.This cookie manages the interaction with online bots and takes the appropriate actions.
ASP.NET_SessionId	session	Issued by Microsoft's ASP.NET Application, this cookie stores session data during a user's website visit.
AWSALBCORS	7 days	This cookie is managed by Amazon Web Services and is used for load balancing.
bm_sz	4 hours	This cookie is set by the provider Akamai Bot Manager. This cookie is used to manage the interaction with the online bots. It also helps in fraud preventions
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
csrftoken	past	This cookie is associated with Django web development platform for python. Used to help protect the website against Cross-Site Request Forgery attacks
JSESSIONID	session	The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application.
nlbi_972453	session	A load balancing cookie set to ensure requests by a client are sent to the same origin server.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
TiPMix	1 hour	The TiPMix cookie is set by Azure to determine which web server the users must be directed to.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
visid_incap_972453	1 year	SiteLock sets this cookie to provide cloud-based website security services.
X-Mapping-fjhppofk	session	This cookie is used for load balancing purposes. The cookie does not store any personally identifiable data.
x-ms-routing-name	1 hour	Azure sets this cookie for routing production traffic by specifying the production slot.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	2 years	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
S	1 hour	Used by Yahoo to provide ads, content or analytics.
sp_landing	1 day	The sp_landing is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
sp_t	1 year	The sp_t cookie is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
__jid	30 minutes	Cookie used to remember the user's Disqus login credentials across websites that use Disqus.
_gat	1 minute	This cookie is installed by Google Universal Analytics to restrain request rate and thus limit the collection of data on high traffic sites.
_gat_UA-28243511-22	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
AWSALB	7 days	AWSALB is an application load balancer cookie set by Amazon Web Services to map the session to the target.
countryCode	session	This cookie is used for storing country code selected from country selector.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
NID	6 months	NID cookie, set by Google, is used for advertising purposes; to limit the number of times the user sees an ad, to mute unwanted ads, and to measure the effectiveness of ads.
personalization_id	2 years	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
vglnk.Agent.p	1 year	VigLink sets this cookie to track the user behaviour and also limit the ads displayed, in order to ensure relevant advertising.
vglnk.PartnerRfsh.p	1 year	VigLink sets this cookie to show users relevant advertisements and also limit the number of adverts that are shown to them.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
_dc_gtm_UA-28243511-20	1 minute	No description
abtest-identifier	1 year	No description
AnalyticsSyncHistory	1 month	No description
ARRAffinityCU	session	No description available.
ccc	1 month	No description
COMPASS	1 hour	No description
cookies.js_dtest	session	No description
debug	never	No description available.
donation-identifier	1 year	No description
f	never	No description available.
GFE_RTT	5 minutes	No description available.
incap_ses_1185_2233503	session	No description
incap_ses_1185_823975	session	No description
incap_ses_1185_972453	session	No description
incap_ses_1319_2233503	session	No description
incap_ses_1319_823975	session	No description
incap_ses_1319_972453	session	No description
incap_ses_1364_2233503	session	No description
incap_ses_1364_823975	session	No description
incap_ses_1364_972453	session	No description
incap_ses_1580_2233503	session	No description
incap_ses_1580_823975	session	No description
incap_ses_1580_972453	session	No description
incap_ses_198_2233503	session	No description
incap_ses_198_823975	session	No description
incap_ses_198_972453	session	No description
incap_ses_340_2233503	session	No description
incap_ses_340_823975	session	No description
incap_ses_340_972453	session	No description
incap_ses_374_2233503	session	No description
incap_ses_374_823975	session	No description
incap_ses_374_972453	session	No description
incap_ses_375_2233503	session	No description
incap_ses_375_823975	session	No description
incap_ses_375_972453	session	No description
incap_ses_455_2233503	session	No description
incap_ses_455_823975	session	No description
incap_ses_455_972453	session	No description
incap_ses_8076_2233503	session	No description
incap_ses_8076_823975	session	No description
incap_ses_8076_972453	session	No description
incap_ses_867_2233503	session	No description
incap_ses_867_823975	session	No description
incap_ses_867_972453	session	No description
incap_ses_9117_2233503	session	No description
incap_ses_9117_823975	session	No description
incap_ses_9117_972453	session	No description
li_gc	2 years	No description
loglevel	never	No description available.
msToken	10 days	No description

Meet the Writer

About Chau Tu

Explore More

Discussing the Impacts of Social Media Algorithms

Plant Emissions: How Do Trees Interact With Pollution?