Pachyderm, Provenance, Data Lakes

Go Time: Golang, Software Engineering ·

Joe Doliner joined the show to talk about managing data lakes with Pachyderm, data containers, provenance, and other interesting Go projects and news.

Join the discussion (https://changelog.zulipchat.com/#narrow/stream/455709-gotime) Changelog++ (https://changelog.com/++) members support our work, get closer to the metal, and make the ads disappear. Join today! Sponsors:

• Linode (https://linode.com/changelog) – Our cloud server of choice. Get one of the fastest, most efficient SSD cloud servers for only $5/mo. Use the code changelog2017 to get 4 months free!

• Fastly (https://www.fastly.com/?utm_source=changelog&utm_medium=podcast&utm_campaign=changelog-sponsorship) – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform.

• Toptal (https://toptal.com/go?utm_source=changelog&utm_medium=podcast&utm_campaign=changelog-sponsorship) – Scale your team and hire from the top 3% of developers and designers with Toptal. Email adam@changelog.com for a personal introduction.

• Backtrace (https://www.backtrace.io/gotime) – Reduce your time to resolution. Go beyond stacktraces and logs. Get to the root cause quickly with deep application introspection at your fingertips.

Featuring:

• Joe Doliner – Website (http://joedoliner.com/), GitHub (https://github.com/jdoliner), X (https://x.com/jdoliner) • Erik St. Martin – GitHub (https://github.com/erikstmartin), X (https://x.com/erikstmartin) • Carlisia Thompson – GitHub (https://github.com/carlisia), LinkedIn (https://www.linkedin.com/in/carlisia), X (https://x.com/carlisia) • Brian Ketelsen – GitHub (https://github.com/bketelsen), X (https://x.com/bketelsen)

Show Notes: Pachyderm.io (https://www.pachyderm.io/)

Let’s build a modern Hadoop (https://medium.com/pachyderm-data/lets-build-a-modern-hadoop-4fc160f8d74f#.mkof29jw7)

Putting the science back in data science (https://www.oreilly.com/ideas/putting-the-science-back-in-data-science)

Martin Fowler - DataLake (https://martinfowler.com/bliki/DataLake.html)

Wikipedia: Data Lake (https://en.wikipedia.org/wiki/Data_lake)

Provenance: the Missing Feature for Rigorous Data Science. Now in Pachyderm 1.1 (https://medium.com/pachyderm-data/provenance-the-missing-feature-for-good-data-science-now-in-pachyderm-1-1-2bd9d376a7eb#.ti3iqat9z)

xkcd: Who were you DenverCoder9? What did you see?! (https://xkcd.com/979/)

Pachyderm Users Slack Channel (https://pachyderm-users.slack.com/)

Interesting Go Projects and News GitLab.com Database Incident - 2017/01/31 (https://docs.google.com/document/d/1GCK53YDcBWQveod9kfzW-VCxIABGiryG7_z_6jHdVik/pub)

Changelog Spotlight #8: Conversational Development and Controversy with Sid Sijbrandij (https://changelog.com/spotlight/8)

Wuzz (visual cURL) (https://github.com/asciimoo/wuzz)

Ozzo Validation (https://github.com/go-ozzo/ozzo-validation)

dep 101 - I Can Haz Downtime? (https://medium.com/i-can-haz-downtime/dep-101-c85e8ab6ed45#.o1tzfxijv)

The State of Go - February 2017 (https://www.youtube.com/watch?v=tY4UKkgb5IY)

Free Software Friday! Each week on the show we give a shout out to an open source project or community that’s made an impact in our day to day developer lives.

• Brian - NATS (https://nats.io)

• Erik - hashcat (https://hashcat.net/hashcat/)

• Carlisia - Hashicorp Vault (https://www.vaultproject.io/)

• Joe - grpc (https://github.com/orgs/grpc/people)

Something missing or broken? PRs welcome! (https://github.com/thechangelog/show-notes/blob/master/gotime/go-time-34.md)