Data Skeptic

Data Skeptic

Author: Vários
Narrator: Vários
Publisher: Podcast
Duration: 316:03:14

More information

Synopsis

Data Skeptic is a data science podcast exploring machine learning, statistics, artificial intelligence, and other data topics through short tutorials and interviews with domain experts.

Show more

Episodes

Fairness in e-Commerce Search

05/09/2022 Duration: 40min

When we search for products in e-commerce stores, we do not care what goes on under the hood to generate the results. However, there may be an intentional algorithmic effort to gravitate us toward a particular product. On the show, today, Abhisek Dash and Saptarshi Ghosh discuss their research on fairness in the search result of Amazon smart speakers.

Listen

Listen
Fraudulent Amazon Reviewers

29/08/2022 Duration: 41min

Chances are that you have bought a product online majorly because of the reviews you saw. Unfortunately, not all reviews are genuine. Today, Rajvardhan Oak shares some insight from his research on fraudulent Amazon reviews. He explained the inner workings of fraudulent reviews and revealed key insights from his qualitative and quantitative study.

Listen

Listen
Ad Targeting in Amazon Smart Speakers

22/08/2022 Duration: 32min

While we give attention to textual data on the web, many do not know the unique power of echo interactions with smart devices for ad targeting. Today, our guest, Umar Iqbal joins us to discuss his study on using Amazon Smart Speakers for ad targeting. He gave interesting revelations about how voice data is captured and analysed for ad purposes. Listen to find out more.

Listen

Listen
Adwords with Unknown Budgets

15/08/2022 Duration: 34min

Rajan Udwani, an Assistant Professor at the University of California Berkeley joins us to discuss his work on AdWords with unknown budgets. He discussed the previous approaches to ad allocation, as well as his maiden approach that introduced randomization for better results. Listen for more.

Listen

Listen
ML Ops Best Practices

12/08/2022 Duration: 30min

Today, we are joined by Piotr Niedźwiedź, Founder and CEO of Neptune.ai. Piotr discusses common MLOps activities by data science teams and how they can take advantage of Neptune.ai for better experiment tracking and efficiency. Listen for more!

Listen

Listen
Affiliate Marketing Rabbithole

08/08/2022 Duration: 52min

Affiliate marketing creates an opportunity for marketers to gain a commission by promoting a product or service. Cookies are typically used for tracking and the advertiser whose product or service is being featured pays the marketing only on transactions. Today's episode covers those approaches and is also a story of conflict between two large companies and how one affiliate marketer got caught in the middle.

Listen

Listen
Monetization of Youtube Conspiracy Theorists

01/08/2022 Duration: 54min

Cameron Ballard joins us today to discuss his work around YouTube conspiracy theories. He revealed interesting observations about conspiracy theories on YouTube including how predatory ads are most common in conspiracy theory videos and how YouTube's algorithm subtly works for predatory ads.

Listen

Listen
User Perceptions of Problematic Ads

25/07/2022 Duration: 37min

Eric Zeng joins us to discuss his study around understanding bad ads and efforts that can be taken to limit bad ads online. He discussed how he and his co authors scrapped a large amount of ad data, applied a machine learning algorithm, and commensurate statistical results.

Listen

Listen
Political Digital Advertising Analysis

21/07/2022 Duration: 35min

NaLette Brodnax, a political scientist and an Assistant Professor in the McCourt School of Public Policy at Georgetown University joins us to discuss her work on analyzing digital advertisements for political campaigns. She used data for electoral campaigns on Facebook to answer questions that help us better understand how digital ads affect the outcome of elections. Click here for additional show notes! Thanks to our sponsor! https://neptune.ai/ Log, store, query, display, organize and compare all your model metadata in a single place

Listen

Listen
Fraud Detection in Crowdfunding Campaigns

18/07/2022 Duration: 35min

Listen

Listen
Artificial Intelligence and Auction Design

11/07/2022 Duration: 43min

Listen

Listen
Privacy Preference Signals

04/07/2022 Duration: 33min

Have you ever wondered what goes on under the hood when you accept a website's cookies? Today, Maximilian Hils, a PhD student in Computer Science, at the University of Innsbruck, Austria, dissects the ad tech industry and the standards put in place to protect users' data. He also shares his thoughts on the use of VPNs as well as other tools that help shield your data from prying eyes on the internet. Click here for additional show notes Thanks to our sponsor: https://clear.ml/ ClearML is an open-source MLOps solution users love to customize, helping you easily Track, Orchestrate, and Automate ML workflows at scale.

Listen

Listen
Neural Architecture Search for CTR Prediction

27/06/2022 Duration: 28min

Ravi Krishna joins us today to talk about his recent work on a differentiable NAS framework for ads CTR prediction. He discussed what CTR prediction is about and why his NAS framework helps in building neural networks for better ads recommendation. Listen to learn about methodology, related literature and his results. Click for additional show notes Thanks to our sponsor: https://astrato.io Astrato is a modern BI and analytics platform built for the Snowflake Data Cloud. A next-generation live query data visualization and analytics solution, empowering everyone to make live data decisions.

Listen

Listen
Algorithmic PPC Management

21/06/2022 Duration: 43min

Effectively managing a large budget of pay per click advertising demands software solutions. When spending multi-million dollar budgets on hundreds of thousands of keywords, an effective algorithmic strategy is required to optimize marketing objectives. In this episode, Nathan Janos joins us to share insights from his work in the ad tech industry. Click for additional show notes Thanks to our sponsor! https://wandb.com/ The developer-first MLOps platform. Build better models faster with experiment tracking, dataset versioning, and model management.

Listen

Listen
Data Skeptic: Ad Tech

18/06/2022 Duration: 42min

Increasingly, people get most if not all of the information they consume online. Alongside the web sites, videos, apps, and other destinations, we're consistently served advertisements alongside the organic content we search for or discover. Targetted ads make it possible for you to discover relevant new products you might otherwise not have heard about. Targetting can also open a pandora's box of ethical considerations. Online advertising is a complex network of automated systems. Algorithms controlling algorithms controlling what we see. This season of Data Skeptic will focus on the applications of data science to digital advertising technology. In this first episode in particular, Kyle shares some of his own personal experiences and insights working in pay-per-click marketing. Click for additional show notes

Listen

Listen
The Reliability of Mobile Phone Data

13/06/2022 Duration: 49min

Our mobile phones generate an incredible amount of data inbound and outbound. In today's episode, Nishant Kishore, a PhD graduate of Harvard University in Infectious Disease Epidemiology, explains how mobility data from mobile phones can be captured and analysed to understand the spread of infectious diseases. Click here for additional show notes Thanks to our sponsor! https://neptune.ai/ Log, store, query, display, organize, and compare all your model metadata in a single place

Listen

Listen
Haywire Algorithms

06/06/2022 Duration: 33min

The pandemic changed how we lived. And this had a ripple effect on the performance of machine learning models. Ravi Parikh joins us today to discuss how the pandemic has affected the performance of machine learning models in clinical care and some actionable steps to fix it. Click here for additional show notes Thanks to our sponsor: Astera Centerprise is a no-code data integration platform that allows users to build ETL/ELT pipelines for modern data warehousing and analytics.

Listen

Listen
School Reopening Analysis

30/05/2022 Duration: 33min

Carly Lupton-Smith joins us today to speak about her research which investigated the consistency between household and county measures of school reopening. Carly is a doctoral researcher in Biostatistics at Johns Hopkins Bloomberg School of Public Health. Listen to know about her findings. Click here for additional show notes on our website! Thanks to our sponsor!ClearML is an open-source MLOps solution users love to customize, helping you easily Track, Orchestrate, and Automate ML workflows at scale. Astera Centerprise is a no-code data integration platform that allows users to build ETL/ELT pipelines for modern data warehousing and analytics.

Listen

Listen
Modern Data Stacks

26/05/2022 Duration: 34min

Today, we are joined by Alexander Thor, a Product Manager at Vizlib, makers of Astrato. Astrato is a data analytics and business intelligence tool built on the cloud and for the cloud. Alexander discusses the features and capabilities of Astrato for data professionals. Visit our website for additional show notes!

Listen

Listen
Emoji as a Predictor

23/05/2022 Duration: 21min

Emojis are arguably one of the most effective ways to express emotions when texting. In today's episode, Xuan Lu shares her research on the use of emojis by developers. She explains how the study of emojis can track the emotions of remote workers and predict future behavior. Listen to find out more!

Listen

Listen

|<
<<
>>
>|

page 9 from 31