Getting Started with Microsoft’s Presidio: A Step-by-Step Guide to Detecting and Anonymizing Personally Identifiable Information PII in Text

Getting Started with Microsoft’s Presidio: A Step-by-Step Guide to Detecting and Anonymizing Personally Identifiable Information PII in Text

In this tutorial, we will explore how to use Microsoft’s Presidio, an open-source framework designed for detecting, analyzing, and anonymizing personally identifiable information (PII) in free-form text. Built on top of the efficient spaCy NLP library, Presidio is both lightweight and modular, making it easy to integrate into real-time applications and pipelines. We will cover…

Read More
LLMs factor in unrelated information when recommending medical treatments

LLMs factor in unrelated information when recommending medical treatments

A large language model (LLM) deployed to make treatment recommendations can be tripped up by nonclinical information in patient messages, like typos, extra white space, missing gender markers, or the use of uncertain, dramatic, and informal language, according to a study by MIT researchers. They found that making stylistic or grammatical changes to messages increases…

Read More
Teaching Mistral Agents to Say No: Content Moderation from Prompt to Response

Teaching Mistral Agents to Say No: Content Moderation from Prompt to Response

In this tutorial, we’ll implement content moderation guardrails for Mistral agents to ensure safe and policy-compliant interactions. By using Mistral’s moderation APIs, we’ll validate both the user input and the agent’s response against categories like financial advice, self-harm, PII, and more. This helps prevent harmful or inappropriate content from being generated or processed — a…

Read More
Celebrating an academic-industry collaboration to advance vehicle technology

Celebrating an academic-industry collaboration to advance vehicle technology

On May 6, MIT AgeLab’s Advanced Vehicle Technology (AVT) Consortium, part of the MIT Center for Transportation and Logistics, celebrated 10 years of its global academic-industry collaboration. AVT was founded with the aim of developing new data that contribute to automotive manufacturers, suppliers, and insurers’ real-world understanding of how drivers use and respond to increasingly…

Read More
Building Event-Driven AI Agents with UAgents and Google Gemini: A Modular Python Implementation Guide

Building Event-Driven AI Agents with UAgents and Google Gemini: A Modular Python Implementation Guide

In this tutorial, we demonstrate how to use the UAgents framework to build a lightweight, event-driven AI agent architecture on top of Google’s Gemini API. We’ll start by applying nest_asyncio to enable nested event loops, then configure your Gemini API key and instantiate the GenAI client. Next, we’ll define our communication contracts, Question and Answer…

Read More
Build an Intelligent Multi-Tool AI Agent Interface Using Streamlit for Seamless Real-Time Interaction

Build an Intelligent Multi-Tool AI Agent Interface Using Streamlit for Seamless Real-Time Interaction

In this tutorial, we’ll build a powerful and interactive Streamlit application that brings together the capabilities of LangChain, the Google Gemini API, and a suite of advanced tools to create a smart AI assistant. Using Streamlit’s intuitive interface, we’ll create a chat-based system that can search the web, fetch Wikipedia content, perform calculations, remember key…

Read More
How to Build an Advanced BrightData Web Scraper with Google Gemini for AI-Powered Data Extraction

How to Build an Advanced BrightData Web Scraper with Google Gemini for AI-Powered Data Extraction

In this tutorial, we walk you through building an enhanced web scraping tool that leverages BrightData’s powerful proxy network alongside Google’s Gemini API for intelligent data extraction. You’ll see how to structure your Python project, install and import the necessary libraries, and encapsulate scraping logic within a clean, reusable BrightDataScraper class. Whether you’re targeting Amazon…

Read More