The Stack Overflow Podcast - From search trees to neural nets, a deep dive into natural language processing

We chatted with three guests:

Miguel Jetté: Head of AI R&D

Josh Dong: AI Engineering Manager

Jenny Drexler: Senior Speech Scientist

When Jette was studying mathematics in the early 2000s, his focus was on computational biology, and more specifically, phylogenetic trees, and DNA sequences. He wanted to understand the evolution of certain traits and the forces that explain why our bones are a certain length or our brains a certain size. As it turned out, the algorithms and techniques he learned in this field mapped very well to the emerging discipline of automatic speech recognition, or ASR. 

During this period, Montreal was emerging as a hotbed for artificial intelligence, and Jette found himself working for Nuance, the company behind the original implementation of Siri. That experience led him to several positions in the world of speech recognition, and he eventually landed at Rev, where he founded the company’s AI department. 

Jette describes Rev as an “Uber for Transcription.” Anyone can sign up for the platform and earn money by listening to audio submitted by clients and transcribing the speech into text. This means the company has a tremendous dataset of raw audio that has been annotated by human beings and, in many cases, assessed a second time by the client. For someone looking to build an AI system that mastered the domain of speech to text, this was a goldmine. 

Jette built the earliest version of Rev’s AI, but it was up to our second guest, Josh Dong, to productize and scale that system. He helped the department transition from older technologies like Perl to more popular languages like Python. He also focused on practical concerns like modularity and reusable components. To combine machine learning and DevOps, Dong added Docker containers and a testing pipeline. If you’re interested in the nuts and bolts of keeping a system like Rev’s running at tremendous scale, you’ll want to check out this part of the show. 

We also explore some of the fascinating future and promise this technology holds in our time with Jenny Drexler. She explains how Rev is moving from a hybrid model—one that combines Jette’s older statistical techniques with Dong’s newer machine learning approach—to a new system that will be ML from end-to-end. This will open up the door for powerful applications, like a single system that can convert speech text across multiple languages in a single piece of audio. 

“One of the things that's really cool about these end to end models is that basically, whatever data you have, it can learn to handle it. So a very similar architecture can do sequence to sequence learning with different kinds of sequences. The model architecture that you might use for speech recognition can actually look very similar to what you might use for translation. And you can use that same architecture, to say, feed in audio in lots of different languages and be able to do transcription for any of them within one model. It's much harder with the hybrid models to sort of put all the right pieces together to make that happen,” explains Drexler.

If you’re interested in learning more about the past, present, and future of artificial intelligence that can understand our spoken language and learn how to respond, check out the full episode. If you want to learn more about Rev or check out some of the positions they have open, you can find their careers page here.

The Stack Overflow Podcast - Tickets please! Exploring the joys of being a junior engineer

Bligh explains her love for front end and the simple pleasure of bringing a designer’s vision to life

We also talk about making the transition from journalism and digital media to the world of software development. 

You can find her on Twitter here.

You can check out Contact here.

Learn more about Makers here.

Our lifeboat badge winner of the week is Rami Amro Ahmed, who answered the question: What is the difference between Model Factory and a DB seeder in Laravel?

The Stack Overflow Podcast - Information foraging: the tricks great developers use to find solutions

You can check out some more of Henley's work on his blog here. Recent pieces include: 

How much time does the average developer spend typing in their editor versus researching, exploring, and pondering? Henley believes half an hour of inputting actual code a day is realistic, despite what you've heard about the 10X developer in your area. 

The Stack Overflow Podcast - A good software tutorial explains the How. A great one explains the Why.

Karl is interested in the use of low code tools to extend development work beyond the engineering department. He also believes this approach, when done properly, allows teams to release new iterations more rapidly.

Check out his company, draft.dev.

Follow him on Twitter or LinkedIn.

This week's lifefboat badge goes to Günter Zöchbauer, who explained: How to use 2 mixins in State in Flutter? 

The Stack Overflow Podcast - Don’t build it: advice on civic tech from MIT’s GOV/LAB

Innocent is  a research associate at the MIT Gov /Lab. You can find him on Twitter here.

Luke is the Founder and Executive Director of the civic technology organization Grassroot, as a practitioner-in-residence in 2021. You can follow him on Twitter here.

Our lifeboat of the week goes to John Rotenstein, who explained: Why some services are called “AWS XXX” and the others “Amazon XXX”.

 

The Stack Overflow Podcast - Unpacking observability and OpenTelemetry with Spiros Xanthos of Splunk

You can read more about Spiros on his LinkedIn or Twitter.

There is some good backstory on his first company, Log Insight, here. A rundown of the acquisition that led to Spiros joining Splunk is here. There are also some interesting details in Splunk's blog on the deal, which calls out Omnition as a "a stealth-mode SaaS company that is innovating in distributed tracing, improving monitoring across microservices applications."

If you enjoy the conversation and want to hear more, Spiros has done some interesting talks that are up on Youtube here.

Our lifeboat of the week goes to Willie Mentzel, who explains how to: Round Double to 1 decimal place in kotlin: from 0.044999 to 0.1.

 

The Stack Overflow Podcast - Blocking the haters as a service

Chou, a  Stanford educated computer scientist and electrical engineer, cut her teeth in Silicon Valley with stints at Facebook, Quora, and Pinterest, where she advocated for a stronger focus on diversity. 

Block Party describes its mission as building "anti-harassment tools against online abuse, but more fundamentally we are building solutions for user control, protection, and safety."

As CEO and lead engineer, Chou gets to choose the company's tools. Block Party is built with technologies like Render, Flask, and Jinja. Paul is very jealous of this stack.

Our lifeboat badge winner of the week is Bryan Oakley, who answered the question: How to redirect print statements to Tkinter text widget?

 

 

The Stack Overflow Podcast - Build engineering at Apple and the future of deploy previews

Eric was a build engineer at Apple for many years, then started a FeaturePeek which went through Y-combinator. He talks about what he learned from those experiences and how he'll be applying that knowledge to his new job at Netlify.

The teams combined forces to make the process of submitting and gathering feedback on deploy previews easier and more broadly accessible outside technical teams. As Cassidy explained:

“Based on technology from FeaturePeek, Deploy Previews enables reviewers to comment, screen record, and annotate right from the actual preview link. No new tabs. No new tools. Everyone’s feedback is recorded back in the GitHub pull request and can even extend to popular productivity tools such as Clubhouse.io, Linear, and Trello.”

This feature set is near and dear to Ben’s heart. Now folks from marketing and design can offer feedback and be more tightly involved in the development process for new features, products, and websites. All without really learning Git! 

Also discussed this episode: weirdware, workflow automation, Jerry Garcia, compound bows, and the spread of Git and branch methodology to areas well outside software development.