{"id":1073,"date":"2014-03-28T08:53:55","date_gmt":"2014-03-28T12:53:55","guid":{"rendered":"http:\/\/www.craigperler.com\/blog\/?p=1073"},"modified":"2024-06-06T23:25:56","modified_gmt":"2024-06-07T03:25:56","slug":"can-yelp-reviews-predict-real-estate-prices","status":"publish","type":"post","link":"https:\/\/www.craigperler.com\/blog\/2014\/03\/28\/can-yelp-reviews-predict-real-estate-prices\/","title":{"rendered":"Can Yelp Reviews Predict Real Estate Prices?"},"content":{"rendered":"\n<p class=\"has-text-align-left\">I took a\u00a0<a href=\"http:\/\/people.stern.nyu.edu\/ja1517\/pdsfall2012\/\" target=\"_blank\" rel=\"noopener\">Data Science class<\/a>\u00a0in my <a href=\"https:\/\/www.craigperler.com\/blog\/category\/mba-notes\/\">MBA program<\/a>, and I was recently re-reading our final project write-up. \u00a0The assignment was to take a complex data set, do some analysis on it, and show the results.  My group decided to focus on NYC real estate. \u00a0We boldly set out to try to use subjectively important variables to predict real estate prices of NYC apartments.<\/p>\n\n\n\n<p class=\"has-text-align-left\">Our thesis at the onset was that deals could be found in unexpected places and neighborhoods that one may not have previously considered. \u00a0After we realized that the granularity of data required for this project wasn&#8217;t readily available, we pivoted to a more reasonable thought exercise:\u00a0might lagging annual or quarterly Yelp reviews be a predictor of real estate listing prices &#8211; that if local businesses were seeing an uptrend in reviews (more reviews or better reviews) in a given year or quarter, would we see an increase in the listing prices of local real estate, and if so, can we use this knowledge to forecast prices?<\/p>\n\n\n\n<h2 id=\"project-outcomes\" class=\"wp-block-heading\">Project Outcomes<\/h2>\n\n\n\n<p class=\"has-text-align-left\">The project entailed marrying data from Yelp, Trulia, Zillow, Google, and NYC Open Data, organizing it all, running various regression analyses on it, and then visualizing the results. &nbsp;We generated some cool charts. &nbsp;For example, this graph shows&nbsp;average prices for various property sizes from Trulia plotted against the average reviews from Yelp for pharmacies, restaurants, and bars.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"839\" height=\"748\" src=\"https:\/\/i0.wp.com\/www.craigperler.com\/blog\/wp-content\/uploads\/2014\/03\/yelp_v_trulia.png?resize=839%2C748&#038;ssl=1\" alt=\"yelp_v_trulia\" class=\"wp-image-1075\" srcset=\"https:\/\/i0.wp.com\/www.craigperler.com\/blog\/wp-content\/uploads\/2014\/03\/yelp_v_trulia.png?w=839&amp;ssl=1 839w, https:\/\/i0.wp.com\/www.craigperler.com\/blog\/wp-content\/uploads\/2014\/03\/yelp_v_trulia.png?resize=300%2C267&amp;ssl=1 300w\" sizes=\"auto, (max-width: 839px) 100vw, 839px\" \/><\/figure>\n<\/div>\n\n\n<p class=\"has-text-align-left\">In Q2 of 2012, bar and pharmacy reviews improve noticeably.\u00a0 It appears that rankings improve after big property type prices increase, and though it\u2019s hard to tell, following the uptrends in Yelp reviews, there is a slight increase in the price of 1-, 2- and 3-bedroom units.<\/p>\n\n\n\n<p class=\"has-text-align-left\">Ultimately, our conclusion was that the best predictor of next year&#8217;s prices are this year&#8217;s prices! \u00a0In other words, data science isn&#8217;t easy. \u00a0It was an interesting exercise if nothing else. \u00a0Full writeup is available <a href=\"https:\/\/docs.google.com\/document\/d\/1hhVEUQIEyf5iCutXl2kWnezEpjCe79y8_XyXiEWkxGA\/edit\" target=\"_blank\" rel=\"noopener\">here<\/a>, and all the code is up on <a href=\"https:\/\/github.com\/cperler\/PDS-Project\" target=\"_blank\" rel=\"noopener\">github<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I took a\u00a0Data Science class\u00a0in my MBA program, and I was recently re-reading our final project write-up. \u00a0The assignment was to take a complex data set, do some analysis on&hellip;<\/p>\n","protected":false},"author":1,"featured_media":1621,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[46],"tags":[],"powerkit_post_featured":[],"class_list":{"0":"post-1073","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-projects"},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/www.craigperler.com\/blog\/wp-content\/uploads\/2014\/03\/pexels-photo.jpg?fit=1880%2C1125&ssl=1","jetpack_shortlink":"https:\/\/wp.me\/p1SwZ6-hj","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/posts\/1073","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/comments?post=1073"}],"version-history":[{"count":5,"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/posts\/1073\/revisions"}],"predecessor-version":[{"id":1623,"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/posts\/1073\/revisions\/1623"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/media\/1621"}],"wp:attachment":[{"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/media?parent=1073"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/categories?post=1073"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/tags?post=1073"},{"taxonomy":"powerkit_post_featured","embeddable":true,"href":"https:\/\/www.craigperler.com\/blog\/wp-json\/wp\/v2\/powerkit_post_featured?post=1073"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}