Overview
Alias employs advanced text similarity algorithms to detect duplicate and near-duplicate responses. This helps identify participants who may be submitting copy-pasted or templated responses to multiple questions or across different surveys. There are two types of duplicate checks performed by Alias:- Self-duplicate
- Cross-duplicate
Self-duplicate
Self-duplicate detection identifies cases where a participant submits similar or identical responses to multiple questions within the same survey. Example:- Question 1: “Describe your typical day.”
- Response 1: “Yoga, healthy eating, focused work, regular breaks, evening reading.”
- Question 2: “What does your daily routine entail?”
- Response 2: “Healthy eating, focused work, regular breaks, and evening reading.”
Cross-duplicate
Cross-duplicate detection identifies cases where multiple participants submit similar or identical responses to the same question across different submissions. Example:- Question: “What drives you professionally?”
- Participant 1 Response: “Challenge, learning, innovation, making an impact.”
- Participant 2 Response: “Challenge + learning + innovation + having an impact.”
Response Grouping
In addition to flagging duplicate responses, Alias clusters similar responses into groups. Theresponse_groups object in the API response assigns an integer to each question, representing the cluster its response belongs to.
For example:
Interpreting Duplicate Detection Results
When a duplicate is detected, the corresponding question’s response is flagged in thechecks object of the API response.
For example:
Next Steps
- Learn how behavioral tracking can help detect fraudulent activity.
- Understand how effort scoring provides a more nuanced assessment of response quality.
- Explore the API reference for details on the
checksandresponse_groupsobjects.

