Clio: A privacy-preserving system that provides insights into real-world AI usage

By bella December 13, 2024

What are people using AI models for? Despite the rapid growth in popularity of large language models, we still know very little about what they are used for.

This isn’t just out of curiosity, or even sociological research. Understanding how people actually use language models is important for security reasons: providers put a lot of effort into pre-deployment testing and use trust and safety systems to prevent misuse. But the scale and diversity of language models makes it hard to understand what they are used for (not to mention any kind of comprehensive security monitoring).

There’s another key factor that prevents us from having a clear understanding of how AI models are used: privacy. At Anthropic, our Claude model is not trained on user conversations by default, and we take protecting our users’ data very seriously. So how do we study and observe how our systems are used while strictly protecting our users’ privacy?

Cl aude Insights and Observations (“Clio” for short) is our attempt to answer this question. Clio is an automated analytics tool that performs privacy-preserving analysis of real-world language model usage. It gives us insight into everyday usage at claude.ai in a similar way to tools like Google Trends. It’s already helping us improve our security measures. In this post (with the full research paper attached), we describe Clio and some of its initial results.

How Clio Works: Privacy-Preserving Analytics at Scale

Traditional top-down security approaches (such as assessments and red teams) rely on knowing what to look for in advance. Clio takes a different approach, enabling bottom-up pattern discovery by distilling conversations into abstract, understandable clusters of topics. It does this while protecting user privacy: data is automatically anonymized and aggregated, and only higher-level clusters are visible to human analysts.

All of our privacy protections are extensively tested, as described in our research paper.

How People Use Claude: Insights from Clio

Using Clio, we were able to gain insight into how people use claude.ai in practice. While public datasets such as WildChat and LMSYS-Chat-1M provide useful information about how people use language models, they only capture specific contexts and use cases. Clio gives us a glimpse into the full real-world usage of claude.ai (which may differ from other AI systems due to differences in user base and model type).

Summary of Clio’s analysis steps, illustrated using fictional examples of conversations.

Here’s a brief overview of Clio’s multi-stage process:

Extracting Aspects: For each conversation, Clio extracts multiple “aspects” — specific properties or metadata, such as the topic of the conversation, the number of back-and-forths in the conversation, or the language used.

Semantic Clustering: Similar conversations are automatically grouped based on topics or general themes.

Cluster Descriptions: Each cluster receives a descriptive title and summary that captures common themes from the raw data while excluding private information.

Building Hierarchies: Clusters are organized into multi-level hierarchies for easier exploration. They can then be presented in an interactive interface that a human factors analyst can use to explore patterns along different dimensions (topics, language, etc.).

“Laptop Battery Online Shop- Replacement Laptop Battery, Cell phone Battery, Tablet Battery, Power Tools Battery Wholesale and Retailer in United Kingdom. All products are brand new with one year warranty and factory price!”

Clio: A privacy-preserving system that provides insights into real-world AI usage

By bella

Leave a Reply Cancel reply

You Missed

Spectrum AI broadens scope of pivotal study of AI wound platform

Sony WF-C510 details released

Lenovo ThinkBook Plus Gen 5 Hybrid review: Lenovo, please make more laptops like this

LG Watch Urbane 2nd Edition Android Wear smartwatch with LTE announced

By bella

Related Post

Leave a Reply Cancel reply

You Missed