This morning I had the opportunity to chat with software engineers and data scientists at the AI Dev World Conference on a topic I just happen to be v...
One of our engineering goals this year at Pinpoint is to improve the entire team’s knowledge about the different methodologies and tools available for us to use as software engineers, data engineers and data scientists. We have found some success in having team members attend conferences related to their roles, then share their takeaways with the team upon returning. This allows us to continually expose ourselves to new ideas, learn about new tools and continually improve our overall knowledge.
At Pinpoint, we use R for most of our data science work, from data exploration and data cleaning to modeling and testing. To help improve how we use R at Pinpoint, a colleague and I attended the RStudio Conference in January. My goal in attending the conference was to both get concrete tips and tricks for using R on a day-to-day basis as well as to see how other companies are using R. I was particularly interested in seeing how R is used in large scale projects and how companies are using it to scale their data science applications so we could apply those strategies as our own applications scale. I attended several talks by people who were using R for serious business cases and quickly noticed a theme — most companies are creating an API using the plumber package and deploying containerized versions of their R programs as a microservice.
Even as a regular R user, I still have a lot to learn about all of the R ecosystem and its capabilities. Here are my takeaways and highlights from the conference.
My goal was to expand my knowledge of the R ecosystem. The conference was a great way to learn more R packages that I haven’t heard of or used and determine if there is anything that we should incorporate into our process.
These are some of the new packages I discovered at the conference:
In addition, the Tidyverse also continues to improve, with bug fixes and new features being added continually. At this point it is a mandatory tool for anyone using R for data science.
The talks I enjoyed the most focused on the intersection of machine learning and human intelligence. Humans design machine learning algorithms as well as the organizations that use artificial intelligence. The human element is often missed in discussions about artificial intelligence, from the ways debugging is handled (by humans) to how we can prevent bias in machine learning. Here were my favorite talks.
I’ve already started using some of these new techniques and tools in my work at Pinpoint as I share them with the rest of the team. Ultimately, they will make our product more resilient, easier to debug and more transparent for developers and managers.
This morning I had the opportunity to chat with software engineers and data scientists at the AI Dev World Conference on...