Synopsis
Data Skeptic is a data science podcast exploring machine learning, statistics, artificial intelligence, and other data topics through short tutorials and interviews with domain experts.
Episodes
-
ML Ops in Production
06/10/2022 Duration: 41minMoses Guttman from Clear ML joins us to share insights about how organizations leveraging machine learning keep their programs on track. While many parallels exist between the software development life cycle (SWLC) and the machine learning development life cycle, successful deployments of ML in production have demonstrated that a unique set of tools is required. Moses and I discuss the emergence of ML Ops, success stories, and how modern teams leverage tools like Clear ML's open source solution to maximize the value of ML in the organization.
-
Ad Network Tomography
03/10/2022 Duration: 35minData sharing in the ad tech space has largely been a black box system. While it is obvious the data is being collected, the data sharing process is obscure to users. On the show today, Maaz Bin Musa and Rishab, both researchers at the University of Iowa, speak about the importance of data transparency and their tool, ATOM for data transparency. Listen to find out how ATOM uncovers data-sharing relationships in the ad-tech space.
-
First Party Tracking Cookies
26/09/2022 Duration: 35minWhen you accept cookies on a website, you cannot tell whether the cookies are used for tracking your personal data or not. Shaoor Munir’s machine learning model does that. On the show today, the Ph.D student at the University of California, discussed the world of first-party cookies and how he developed a machine learning model that predicts whether a first-party cookie is used for tracking purposes.
-
The Harms of Targeted Weight Loss Ads
19/09/2022 Duration: 34minLiza Gak, a Ph.D. student at UC Berkeley, joins us to discuss her research on harmful weight loss advertising. She discussed how weight loss ads are not fact-checked, and how they typically target the most vulnerable. She extensively discussed her interview process, data analysis, and results. Listen for more!
-
Podcast Advertising
12/09/2022 Duration: 35minGrowing your podcast to the point of monetization is not a walk in the park. Today, Rob Walch, the VP of Podcast Relations at Libsyn talks about podcast advertising. He discussed how advertising works, how to grow your audience and some blueprints to being a successful podcaster. Listen for more.
-
Fairness in e-Commerce Search
05/09/2022 Duration: 40minWhen we search for products in e-commerce stores, we do not care what goes on under the hood to generate the results. However, there may be an intentional algorithmic effort to gravitate us toward a particular product. On the show, today, Abhisek Dash and Saptarshi Ghosh discuss their research on fairness in the search result of Amazon smart speakers.
-
Fraudulent Amazon Reviewers
29/08/2022 Duration: 41minChances are that you have bought a product online majorly because of the reviews you saw. Unfortunately, not all reviews are genuine. Today, Rajvardhan Oak shares some insight from his research on fraudulent Amazon reviews. He explained the inner workings of fraudulent reviews and revealed key insights from his qualitative and quantitative study.
-
Ad Targeting in Amazon Smart Speakers
22/08/2022 Duration: 32minWhile we give attention to textual data on the web, many do not know the unique power of echo interactions with smart devices for ad targeting. Today, our guest, Umar Iqbal joins us to discuss his study on using Amazon Smart Speakers for ad targeting. He gave interesting revelations about how voice data is captured and analysed for ad purposes. Listen to find out more.
-
Adwords with Unknown Budgets
15/08/2022 Duration: 34minRajan Udwani, an Assistant Professor at the University of California Berkeley joins us to discuss his work on AdWords with unknown budgets. He discussed the previous approaches to ad allocation, as well as his maiden approach that introduced randomization for better results. Listen for more.
-
ML Ops Best Practices
12/08/2022 Duration: 30minToday, we are joined by Piotr Niedźwiedź, Founder and CEO of Neptune.ai. Piotr discusses common MLOps activities by data science teams and how they can take advantage of Neptune.ai for better experiment tracking and efficiency. Listen for more!
-
Affiliate Marketing Rabbithole
08/08/2022 Duration: 52minAffiliate marketing creates an opportunity for marketers to gain a commission by promoting a product or service. Cookies are typically used for tracking and the advertiser whose product or service is being featured pays the marketing only on transactions. Today's episode covers those approaches and is also a story of conflict between two large companies and how one affiliate marketer got caught in the middle.
-
Monetization of Youtube Conspiracy Theorists
01/08/2022 Duration: 54minCameron Ballard joins us today to discuss his work around YouTube conspiracy theories. He revealed interesting observations about conspiracy theories on YouTube including how predatory ads are most common in conspiracy theory videos and how YouTube’s algorithm subtly works for predatory ads.
-
User Perceptions of Problematic Ads
25/07/2022 Duration: 37minEric Zeng joins us to discuss his study around understanding bad ads and efforts that can be taken to limit bad ads online. He discussed how he and his co authors scrapped a large amount of ad data, applied a machine learning algorithm, and commensurate statistical results.
-
Political Digital Advertising Analysis
21/07/2022 Duration: 35minNaLette Brodnax, a political scientist and an Assistant Professor in the McCourt School of Public Policy at Georgetown University joins us to discuss her work on analyzing digital advertisements for political campaigns. She used data for electoral campaigns on Facebook to answer questions that help us better understand how digital ads affect the outcome of elections. Click here for additional show notes! Thanks to our sponsor! https://neptune.ai/ Log, store, query, display, organize and compare all your model metadata in a single place
-
-
-
Privacy Preference Signals
04/07/2022 Duration: 33minHave you ever wondered what goes on under the hood when you accept a website’s cookies? Today, Maximilian Hils, a PhD student in Computer Science, at the University of Innsbruck, Austria, dissects the ad tech industry and the standards put in place to protect users’ data. He also shares his thoughts on the use of VPNs as well as other tools that help shield your data from prying eyes on the internet. Click here for additional show notes Thanks to our sponsor: https://clear.ml/ ClearML is an open-source MLOps solution users love to customize, helping you easily Track, Orchestrate, and Automate ML workflows at scale.
-
Neural Architecture Search for CTR Prediction
27/06/2022 Duration: 28minRavi Krishna joins us today to talk about his recent work on a differentiable NAS framework for ads CTR prediction. He discussed what CTR prediction is about and why his NAS framework helps in building neural networks for better ads recommendation. Listen to learn about methodology, related literature and his results. Click for additional show notes Thanks to our sponsor: https://astrato.io Astrato is a modern BI and analytics platform built for the Snowflake Data Cloud. A next-generation live query data visualization and analytics solution, empowering everyone to make live data decisions.
-
Algorithmic PPC Management
21/06/2022 Duration: 43minEffectively managing a large budget of pay per click advertising demands software solutions. When spending multi-million dollar budgets on hundreds of thousands of keywords, an effective algorithmic strategy is required to optimize marketing objectives. In this episode, Nathan Janos joins us to share insights from his work in the ad tech industry. Click for additional show notes Thanks to our sponsor! https://wandb.com/ The developer-first MLOps platform. Build better models faster with experiment tracking, dataset versioning, and model management.
-
Data Skeptic: Ad Tech
18/06/2022 Duration: 42minIncreasingly, people get most if not all of the information they consume online. Alongside the web sites, videos, apps, and other destinations, we’re consistently served advertisements alongside the organic content we search for or discover. Targetted ads make it possible for you to discover relevant new products you might otherwise not have heard about. Targetting can also open a pandora’s box of ethical considerations. Online advertising is a complex network of automated systems. Algorithms controlling algorithms controlling what we see. This season of Data Skeptic will focus on the applications of data science to digital advertising technology. In this first episode in particular, Kyle shares some of his own personal experiences and insights working in pay-per-click marketing. Click for additional show notes