i am a senior research scientist at microsoft research in new york city, where my work in the area of computational social science involves applications of statistics and machine learning to large-scale social data. i was previously a member of the social dynamics group at yahoo! research. i received my ph.d. from columbia university's physics department where i am an adjunct professor in the applied math department. please see my resume for more project and background information.

this site serves several purposes, from presenting and organizing my current research and teaching efforts to publishing code and tips that i hope others will find useful.

i bookmark lots of references on delicious, occasionally tweet things, post random tidbits on tumblr, and share photos on flickr.

latest tips

my latest geek tips, also available on twitter, tumblr, or as plain text:

apache applescript awk bash c c++ consumer css cu cvs debian emacs excel firefox flickr gcc gentoo git gmail google grammar graphviz hadoop html idvd imagemagick iphone iphoto ipod irc itunes java javascript keynote latex lifehack linux mac macosx matlab meta mobile mturk mysql network networking perl php python quicktime razr rss rstats rsync ruby safari sed sge shell silverlight sms sql ssh svn test trac treo unix video windows word wordpress x11 xml macosx: set java environment using export JAVA_HOME=`/usr/libexec/java_home` http://bit.ly/2eXuGRf latex: use detex doc.tex | wc -w or texcount doc.tex to count words in a rendered document http://bit.ly/2eIRK65 rstats: new ways to hide part of a legend: scale_size(guide = F) or guides(size = F) http://bit.ly/2eXS1CI rstats: simple ways to sum across rows or columns with dplyr http://bit.ly/2e5QDwS rstats: use the zoo package for (irregular) time series http://bit.ly/2d66xXG