Idea Graveyard: Ground Truth Aggregation

[Note: this post is latest in my series of "startups that weren't." You can read more about other ideas I've (for now!) put in the idea graveyard.] Not too long ago quant research strategies employed by hedge funds were one of the few places time series data was analyzed at scale. But with the flood … Continue reading Idea Graveyard: Ground Truth Aggregation

Statistical Insignificance, Graphic Novel Edition

For those living under a rock [Ed: note irony, as it's doubtful this is on your mind], you may be unaware of the tremendous controversy brewing in academic circles on the topic of reproducibility of published research. For those who think this may be a silly intellectual argument, the truth is quite alarming. The whole … Continue reading Statistical Insignificance, Graphic Novel Edition

“Succulent” Tuna Fish

Simple project Number Two entailed refashioning empty tins of tuna for the beta version below. After pushing this to prod I immediately realized how to improve. After the pic for the step-by-step caption-driven DIY tutorial! What to do with tinned fish containers apart from toss in the recycle bin? Why, use as a foundation for … Continue reading “Succulent” Tuna Fish

Lazy DIY: From Fire Escape to ‘Garden’

I recently moved into my new apartment in Bed Stuy and instantly knew how to transform an ugly, unusable fire escape into something playful, functional and maybe even a bit ironic. My skills in craftiness are more conceptual, but this was pretty straightforward and results were as I expected/hoped. You'll need a tape measure, utility knife or … Continue reading Lazy DIY: From Fire Escape to ‘Garden’

When Data Science Alone Won’t Cut it: Deriving Signal from Observations in the Maritime Domain

I recently read an article (paywall) in the WSJ about Paul Allen’s Vulcan initiative to curb illegal fishing. It's insightful and sheds light on Big Data techniques to address societal problems. After thinking on the story, it struck me that it could be used as a pedagogical tool to synthesize data science with domain knowledge. … Continue reading When Data Science Alone Won’t Cut it: Deriving Signal from Observations in the Maritime Domain

You Say Predicting, I say Reporting

The more the world of Big Data/novel analytic techniques/machine learning is internalized, the greater the likelihood assumptions move to presumptions and technical terms unwittingly become marketing terms. The Gartner Hype Cycle is a great illustration. In the context of "predictive analytics," it's worth knowing what people actually mean. First, predictions--obviously the term is about the future, … Continue reading You Say Predicting, I say Reporting

Idea Graveyard: Leadgen for Cloud Infrastructure

[This is the first in a series of posts about businesses that weren't for a variety of reasons. You can read about others in Projects and Ideas.] In October 2016 I spent a few months with my friend and former board member/employee Jake Baillie looking into building a service that would offer a cloud computing estimation/calculator to help … Continue reading Idea Graveyard: Leadgen for Cloud Infrastructure

Government Stats Are Ready for Change (Book Review)

For those of you similarly interested (obsessed?) with the changing role of government statistics relative to the explosion of highly dimensional private sector data, I recommend having a look at Innovations in Federal Statistics: Combining Data Sources While Protecting Privacy from the National Academy of Sciences. It's an easy read and offers a solid foundation for those who seek a … Continue reading Government Stats Are Ready for Change (Book Review)

Podcasts We Love!

The golden age of radio is upon us (again), this time with fantastic production value and oh so much choice. My top ten below for those who put a premium on commute time. Not an easy list to rank! The folks at Radiotopia are doing a bang-up job and seems like they are just getting started. Le … Continue reading Podcasts We Love!

Talkin’ Geo

Thanks to James Fee for the reboot of Hangouts with James Fee, a geo-centric podcast. The backstory to this goes back close to ten years when Steve Coast started This Week In Maps, a sort of weekly roundtable recorded call (pre podcast, via Skype!) of some ole grumpies (Marc Prioleau Di-Ann Eisnor, Steve and James) … Continue reading Talkin’ Geo

Proxy Indicators: beware of spurious claims

I recently stumbled across a research paper, Using Deep Learning and Google Street View to Estimate the Demographic Makeup of the US, which piqued my interest in derivative uses of data, an ongoing research interest of mine. A variety of deep learning techniques were used to draw conclusions about relationships of car ownership, political affiliation … Continue reading Proxy Indicators: beware of spurious claims

Fake Stats

With the recent spate of fake news (why can’t we just call them lies?), I started thinking about the growing chasm between statistical/fact creators and media consumers. Historically we have put our trust in the Fourth Estate to analyze, filter and present to an audience. For better or worse, today anybody can call themselves a journalist– … Continue reading Fake Stats

Data-derived Products

I love creating data-derived products. I’ve been saying one man’s metadata is another’s data for years, and now that we are in a golden age of data brought on by cheap cloud storage/compute, sensors/devices everywhere and the rise of data scientists, the age of data is upon us. However, getting past the high fives and … Continue reading Data-derived Products

RFF: Quantifying Municipal Growth

I’ve been pursuing several ideas as an independent researcher– some scratch an intellectual itch and others have the makings of new opportunities that require further exploration before they can be commercialized. I call this a request for feedback (RFF). Part of the exploration is throwing ideas onto the Internet and see what comes back, so … Continue reading RFF: Quantifying Municipal Growth

Prop Trading, Hedge Funds and Startups: Looking for Alpha in All the New Places

High-tech startups, hailing from Silicon Valley, and large, well-established banks, hailing from Wall St couldn’t be more different. Yet the golden opportunity of FinTech holds the promise of (somehow) bringing them closer together. This post isn’t about disruption, executive pay, alternative credit scoring models or other topics du jour. It’s about a possible future where … Continue reading Prop Trading, Hedge Funds and Startups: Looking for Alpha in All the New Places

Can Silicon Valley Try Life in the Present, for a Minute?

Perhaps more accurately this should be titled “My Problem with Silicon Valley,” but I have a hunch I am not the only one who feels the region has become a caricature of itself, which is I suspect why many of us love the HBO series Silicon Valley– not because it’s funny, but because it’s eerily … Continue reading Can Silicon Valley Try Life in the Present, for a Minute?

Freedom of Information Act at 50 and Newly Improved

For those of you who care about access to government data for reasons of transparency, feel-good openness, competitive advantage or other, June 30 was an overall ok/good day.Obama signed the FOIA Improvement Act of 2016 in an attempt to solidify his administration’s legacy as the most transparent in US history. Many are skeptical, including your author, … Continue reading Freedom of Information Act at 50 and Newly Improved

Finally, Skepticism in Open Data

Over the past 10ish years I’ve often spoken critically of the hand-wavy open data love fest. Stumbled across an article today that does a great job of putting into context the rhetoric v reality. Centering on toilet usability data, Giuseppe Sollazzo makes a solid argument for rethinking the definition of value in opening data to the … Continue reading Finally, Skepticism in Open Data

Derivative Data

Since selling Urban Mapping last year, I’ve spent more time thinking about how data is can be and is used for alternative purposes. To me, the idea of packaging up organisational data exhaust and redirecting it to non-adjacent markets is an opportunity hidden in plain sight. I’ve been whining that one person’s metadata is another … Continue reading Derivative Data

You Don’t Always Get What You (Pre) Pay For

I was excited to back a Kickstarter project for an IoT enabled sensor lasts Spring. It arrived last month, seemed a little buggy (ok, a lot buggy), so I decided to give it a rest. Never got back to it, but today I received email from the manufacturer this morning indicating significantly degraded capabilities. I’m … Continue reading You Don’t Always Get What You (Pre) Pay For