i am a senior research scientist at microsoft research in new york city, where my work in the area of computational social science involves applications of statistics and machine learning to large-scale social data. i was previously a member of the social dynamics group at yahoo! research. i received my ph.d. from columbia university's physics department where i am an adjunct professor in the applied math department. please see my resume for more project and background information.
this site serves several purposes, from presenting and organizing my current research and teaching efforts to publishing code and tips that i hope others will find useful.
i bookmark lots of references on delicious, occasionally tweet things, post random tidbits on tumblr, and share photos on flickr.
my latest geek tips, also available on twitter, tumblr, or as plain text:
20.09.22.16. rstats: use group_by_(.dots=col_names) to group by a vector of strings containing column names w/ dplyr http://bit.ly/2cp07Q0
20.09.14.16. rstats: use coord_cartesian(ylim=...) instead of scale_y_continuous(lim=...) with geom_smooth to avoid throwing out data http://bit.ly/2cbDT8L
20.09.14.16. rstats: use the directlabels package to easily label things on plots http://bit.ly/2cyyF2J
20.09.06.16. shell: your local weather report, ascii style with curl wttr.in/newyork
20.09.06.16. rstats: use labeller = label_both to show facet variable names along with levels http://bit.ly/2bQVKg3