Navigating the minefield of AI algorithm deception

Is there a context in which algorithms could be designed to deceive, and what are the ethics of this? Where do we draw a line between a good lie and a bad lie, and what are the ethics of a good lie?

Artificial intelligence (AI) is facing an exciting challenge: the art of deception. As AI systems get more complex, their capacity to manipulate information while concealing their true objectives creates a new problem, blurring the distinction between machines and Machiavellian strategists.

Deception, in its most basic form, delivers false information to gain an advantage. In the field of AI, this might appear in various ways. Consider an AI trading bot that intentionally injects noise into market data to disguise its trading patterns.

Consider a self-driving automobile that deliberately swerves to avoid exposing its optimal route to a competitor. In both cases, the AI uses intentional deception to achieve its objectives. Such deception could provide new opportunities for strategic manoeuvring in competitive contexts.

In 2007, Evan Hurwitz and I observed an AI bot named Aiden deceiving while playing a game of poker without being primed to mislead and conceal. Our work delves into the intricate mechanics of how AI can learn on its own to deceive, a concept formerly reserved for the human intellect. This pushed the traditional limits of AI’s capabilities beyond logical computation, including human-like unpredictability and strategic ambiguity.

Our work not only advanced AI by pushing the boundaries of what AI can accomplish, but also prompted a thorough rethinking of the legal, ethical and practical aspects of AI systems capable of such complex behaviour as deception.

The ethical consequences of AI deceptions are extensive. Can we support deliberate lying, even in strategic contexts? Who is responsible for an AI bluffer’s actions? And how can we prevent such systems from abusing human trust and manipulating societal institutions for their own benefit?

One classic example is a person named Taku who appears in a village visibly carrying a gun and asking Thandi the whereabouts of Thuso, who he wants to punish severely. Should Thandi tell Taku where Thuso is, or does she lie and use the time while Taku pursues the lie to inform the police?

In this context, deception is justifiable from a utilitarian perspective because Taku may harm or kill Thuso without the lie. Is there a context in which algorithms could be designed to deceive, and what are the ethics of this? Where do we draw a line between a good lie and a bad lie, and what are the ethics of a good lie?

Countering damaging deception

It is essential to emphasise the importance of responsible AI development and deployment with deceptive capabilities.

One way of dealing with this issue is advocating for openness and explainability as potential protections, ensuring that AI systems can explain their thinking and reasons. This can create trust while mitigating the risks associated with ambiguous intelligence.

However, the dominant form of AI, deep learning, must still be sufficiently advanced to be explainable. However, the technological limitation of accuracy vs explainability tradeoff, where the more accurate the AI system, the less transparent it is, complicates this matter.

The findings by Hurwitz and myself have far-reaching ramifications beyond games and markets. In an increasingly AI-driven society, knowing the potential for algorithmic deception is critical in many industries. From cybersecurity and autonomous vehicles to political campaigns and social media networks, understanding the subtle signs of AI bluffing will be essential in negotiating the intricacies of a just human-machine interaction.

Beyond outright deception, AI can demonstrate strategic ambiguity. By leaving their behaviours open to interpretation, AI systems can create confusion and ambiguity, keeping their opponents guessing.

A chatbot, for example, may generate technically correct but purposefully ambiguous responses, leading humans astray. Similarly, an AI tasked with cybersecurity may deliberately leave vulnerabilities unpatched, producing a false sense of security while discreetly gathering intelligence.

Fortunately, AI has enormous potential as a weapon against its deceiving capability. One way is to examine data patterns for anomaly detection. Anomaly detection, a technique often used to find patterns that deviate from expected behaviour, provides a promising way to detect bluffing instances involving unusual or deceitful conduct.

In situations ranging from online gaming to essential business discussions, computers with anomaly detection algorithms can examine behavioural patterns, decision-making processes and communication styles, highlighting discrepancies or peculiarities that may suggest bluffing.

For example, an anomaly detection system could examine online poker betting patterns and playing styles to find variations that indicate a player is bluffing.

Similarly, slight linguistic alterations or systematic deviations from standard engagement patterns in corporate or diplomatic discussions could be interpreted as potential bluffs. Consider an AI trading agent suddenly departing from its regular risk profile, raising the alarm for possible market manipulation.

Understanding behaviours

Behavioural analysis is another helpful tool. AI systems, like people, can present signals when lying. Monitoring changes in data-gathering patterns, response timings, or internal decision-making processes can reveal departures from expected behaviour, implying intentional dishonesty. This application improves the ability to maintain fairness and integrity in various settings and provides new pathways for analysing and interpreting human behaviour using AI-driven analytics.

However, combatting AI deception will take more work. As AI systems get more advanced, their deception strategies will evolve accordingly. This implies a never-ending arms race in which humans constantly improve detection algorithms to stay up with deceitful AI’s ever-changing strategy.

Beyond the technical hurdles, ethical concerns are significant. Who can be trusted with the power of AI deceit detection? Who determines the parameters for detecting suspicious behaviour, and how do we prevent producing false positives that hinder actual AI innovation?

These questions necessitate thorough research and deliberate policy formulation to guarantee that this tool is used responsibly. The struggle against AI deception is not an existential conflict with computers, but rather a demand for responsible AI development. We must build AI with openness, accountability, and human oversight as its foundation.

By providing AI with deception-detection mechanisms and cultivating a culture of ethical AI research, we may shape a future in which robots empower rather than manipulate, and the arms race of deceit gives way to an era of collaborative intelligence for the benefit of all.

One such option is transparency. If we can create AI systems that behave strategically and explain their rationale, we can reduce the risks of deception and ambiguity. By exposing AI’s rational processes, we can hold it accountable for its acts and build trust between humans and machines.

However, perfect transparency may only sometimes be preferable. In some circumstances, revealing an AI’s genuine objectives may jeopardise its effectiveness. Striking a balance between strategic ambiguity and responsibility will be critical for navigating the ethical minefield of AI deception.

Finally, the rise of AI deceptions demands a new era of critical thinking. We must understand these intelligent machines’ activities with scepticism and alertness rather than taking them at face value. Understanding the potential for deception and ambiguity in AI allows us to better prepare for the complex ethical dilemmas that lie ahead.

The distinction between an innovative strategy and a manipulative scheme is usually narrow. As we enter the age of AI, let us create a future in which intelligence is driven by values of openness, accountability, and, ultimately, human well-being, and carefully navigate the opportunities and risks of designing AI with the ability to deceive. DM

Comments - Please login in order to comment.

Dennis Bailey says:

7 February 2024 at 06:46

But AI can’t stop this yobo from flaunting her wears on DM comments!

Log in to Reply
- Peter Geddes says:
  
  7 February 2024 at 09:57
  
  But AI could correct spelling errors, such as ‘wears’ instead of the correct ‘wares’.
  
  Log in to Reply

Cookie	Duration	Description
__cfduid	1 month	The cookie is used by cdn services like CloudFlare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
_GRECAPTCHA	5 months 27 days	This cookie is set by Google. In addition to certain standard Google cookies, reCAPTCHA sets a necessary cookie (_GRECAPTCHA) when executed for the purpose of providing its risk analysis.
cid	1 year	This is an important cookie in making credit card transaction on the website. It allows the online transaction without storing the credit card information.This service is provided by Stripe.com.
connect.sid	1 month	This cookie is used for authentication and for secure log-in. It registers the log-in information.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
elementor	never	This cookie is used by the website's WordPress theme. It allows the website owner to implement or change the website's content in real-time.
JSESSIONID	session	Used by sites written in JSP. General purpose platform session cookies that are used to maintain users' state across page requests.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__atuvc	1 year 1 month	This cookie is set by Addthis to make sure you see the updated count if you share a page and return to it before our share count cache is updated.
__atuvs	30 minutes	This cookie is set by Addthis to make sure you see the updated count if you share a page and return to it before our share count cache is updated.
__cf_bm	30 minutes	This cookie is set by CloudFlare. The cookie is used to support Cloudflare Bot Management.
__pvi	1 day	This cookie is used for the implementation of the news content from other sites.
bcookie	2 years	This cookie is set by linkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
lidc	1 day	This cookie is set by LinkedIn and used for routing.

Cookie	Duration	Description
__gads	1 year 24 days	This cookie is set by Google and stored under the name dounleclick.com. This cookie is used to track how many times users see a particular advert which helps in measuring the success of the campaign and calculate the revenue generated by the campaign. These cookies can only be read from the domain that it is set on so it will not track any data while browsing through another sites.
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_ga_Y7XD5FHQVG	2 years	This cookie is installed by Google Analytics.
_gat_UA-10686674-1	1 minute	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
ajs_anonymous_id	never	This cookie is set by Segment.io to check the number of ew and returning visitors to the website.
ajs_user_id	never	The cookie is set by Segment.io and is used to analyze how you use the website
ANON_ID	3 months	This cookie is provided by Tribalfusion. The cookie is used to give a unique number to visitors, and collects data on user behaviour like what page have been visited. This cookie also helps to understand which sale has been generated by as a result of the advertisement served by third party.
jam_heavy_ga_session	5 years	This cookie is installed by Google Analytics.
UserID1	3 months	The cookie sets a unique anonymous ID for a website visitor. This ID is used to continue to identify users across different sessions and track their activities on the website. The data collected is used for analysis.
uvc	1 year 1 month	The cookie is set by addthis.com to determine the usage of Addthis.com service.

Cookie	Duration	Description
__tbc	2 years	This cookie is used for measuring the efficiency of advertisement by registering data on visitors from multiple website.
_cc_aud	8 months 26 days	The cookie is set by crwdcntrl.net. The purpose of the cookie is to collect statistical information in an anonymous form about the visitors of the website. The data collected include number of visits, average time spent on the website, and the what pages have been loaded. These data are then used to segment audiences based on the geographical location, demographic, and user interest provide relevant content and for advertisers for targeted advertising.
_cc_cc	session	The cookie is set by crwdcntrl.net. The purpose of the cookie is to collect statistical information in an anonymous form about the visitors of the website. The data collected include number of visits, average time spent on the website, and the what pages have been loaded. These data are then used to segment audiences based on the geographical location, demographic, and user interest provide relevant content and for advertisers for targeted advertising.
_cc_dc	8 months 26 days	The cookie is set by crwdcntrl.net. The purpose of the cookie is to collect statistical information in an anonymous form about the visitors of the website. The data collected include number of visits, average time spent on the website, and the what pages have been loaded. These data are then used to segment audiences based on the geographical location, demographic, and user interest provide relevant content and for advertisers for targeted advertising.
_cc_id	8 months 26 days	The cookie is set by crwdcntrl.net. The purpose of the cookie is to collect statistical information in an anonymous form about the visitors of the website. The data collected include number of visits, average time spent on the website, and the what pages have been loaded. These data are then used to segment audiences based on the geographical location, demographic, and user interest provide relevant content and for advertisers for targeted advertising.
_kuid_	5 months 27 days	The cookie is set by Krux Digital under the domain krxd.net. The cookie stores a unique ID to identify a returning user for the purpose of targeted advertising.
_rxuuid	1 year	The main purpose of this cookie is targeting, advertesing and effective marketing. This cookie is used to set a unique ID to the visitors, which allow third party advertisers to target the visitors with relevant advertisement up to 1 year.
ANON_ID_old	3 months	This cookie helps to categorise the users interest and to create profiles in terms of resales of targeted marketing. This cookie is used to collect user information such as what pages have been viewed on the website for creating profiles.
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
CMID	1 year	The cookie is set by CasaleMedia. The cookie is used to collect information about the usage behavior for targeted advertising.
CMPRO	3 months	This cookie is set by Casalemedia and is used for targeted advertisement purposes.
CMPS	3 months	This cookie is set by Casalemedia and is used for targeted advertisement purposes.
CMST	1 day	The cookie is set by CasaleMedia. The cookie is used to collect information about the usage behavior for targeted advertising.
DSID	1 hour	This cookie is setup by doubleclick.net. This cookie is used by Google to make advertising more engaging to users and are stored under doubleclick.net. It contains an encrypted unique ID.
google_push	5 minutes	This cookie is set by the Bidswitch. This cookie is used to collect statistical data related to the user website visit such as the number of visits, average time spent on the website and what pages have been loaded. This collected information is used to sort out the users based on demographics and geographical locations inorder to serve them with relevant online advertising.
i	1 year	The purpose of the cookie is not known yet.
id	3 months	The main purpose of this cookie is targeting and advertising. It is used to create a profile of the user's interest and to show relevant ads on their site. This Cookie is set by DoubleClick which is owned by Google.
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
IDSYNC	1 year	This cookie is used for advertising purposes.
KADUSERCOOKIE	3 months	The cookie is set by pubmatic.com for identifying the visitors' website or device from which they visit PubMatic's partners' website.
KTPCACOOKIE	1 day	This cookie is set by pubmatic.com for the purpose of checking if third-party cookies are enabled on the user's website.
ljt_reader	1 year	This is a Lijit Advertising Platform cookie. The cookie is used for recognizing the browser or device when users return to their site or one of their partner's site.
loc	1 year 1 month	This cookie is set by Addthis. This is a geolocation cookie to understand where the users sharing the information are located.
mc	1 year 1 month	This cookie is associated with Quantserve to track anonymously how a user interact with the website.
mt_mop	1 month	Stores information about how the user uses the website such as what pages have been loaded and any other advertisement before visiting the website for the purpose of targeted advertisements.
personalization_id	2 years	This cookie is set by twitter.com. It is used integrate the sharing features of this social media. It also stores information about how the user uses the website for tracking and targeting.
suid_legacy	1 year	This cookie is used to collect information on user preference and interactioin with the website campaign content. This cookie is used for promoting events and products by the webiste owners on CRM-campaign-platform.
TDCPM	1 year	The cookie is set by CloudFlare service to store a unique ID to identify a returning users device which then is used for targeted advertising.
TDID	1 year	The cookie is set by CloudFlare service to store a unique ID to identify a returning users device which then is used for targeted advertising.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
tluid	3 months	This cookie is set by the provider AdRoll.This cookie is used to identify the visitor and to serve them with relevant ads by collecting user behaviour from multiple websites.
tuuid	1 year	This cookie is set by .bidswitch.net. The cookies stores a unique ID for the purpose of the determining what adverts the users have seen if you have visited any of the advertisers website. The information is used for determining when and how often users will see a certain banner.
tuuid_lu	1 year	This cookie is set by .bidswitch.net. The cookies stores a unique ID for the purpose of the determining what adverts the users have seen if you have visited any of the advertisers website. The information is used for determining when and how often users will see a certain banner.
uid	5 months 27 days	This cookie is used to measure the number and behavior of the visitors to the website anonymously. The data includes the number of visits, average duration of the visit on the website, pages visited, etc. for the purpose of better understanding user preferences for targeted advertisments.
uuid	1 year 27 days	To optimize ad relevance by collecting visitor data from multiple websites such as what pages have been loaded.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.
wfivefivec	1 year 1 month	The domain of this cookie is owned by Dataxu. The main business activity of this cookie is targeting and advertising. This cookie tracks the advertisement report which helps us to improve the marketing activity.
xbc	2 years	This cookie is used for optmizing the advertisement on the website more relevant by analysing the user behaviour and interaction with the website.
YSC	session	This cookies is set by Youtube and is used to track the views of embedded videos.

Cookie	Duration	Description
__browsiSessionID	30 minutes	No description available.
__browsiUID	1 year	No description available.
__cflb	23 hours	This cookie is used by Cloudflare for load balancing.
__gpi	1 year 24 days	No description
ajs_group_id	never	This cookie is set by Segment.io. The purpose of the cookie is currently not identified.
blkbs	6 days 23 hours	No description
charitable_session	1 day	No description available.
cookietest	session	No description
debug	never	No description available.
gCStest	7 years 1 month 26 days 16 hours	No description
muc_ads	2 years	No description
revengine_browser_id	session	No description
revengine-browser-token	session	RevEngine Data Tool.
rl_user_id	never	No description available.
tf_respondent_cc	6 months	No description
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
vic_loc_error	10 minutes	No description
vicinity_id	1 year 10 months 24 days 11 hours	Vicinity Advertising.

Defend Truth

Opinionista

A culture of ethical AI research can counter dangerous algorithms designed to deceive

Is there a context in which algorithms could be designed to deceive, and what are the ethics of this? Where do we draw a line between a good lie and a bad lie, and what are the ethics of a good lie?

Countering damaging deception

Understanding behaviours

Comments - Please login in order to comment.

Top Reads This Hour

Dr Yakub Essack — Gift of the Givers medical team leader leaves a legacy of unmatched kindness

Following the path from city to country living

AirFryday: Glazed carrots in your air fryer, with a rosemary trick

ANC National Disciplinary Committee summons Zuma over MK party support

The looting of Prasa and the terrible price SA has been forced to pay

TOP READS IN SECTION

Dr Yakub Essack — Gift of the Givers medical team leader leaves a legacy of unmatched kindness

Leaked audio exposes ANC election plan for government PR events to showcase successes

Nearly R5-million cash, luxury watches seized as 28s gang boss accused Ralph Stanfield’s brother arrested

ANC National Disciplinary Committee summons Zuma over MK party support

Following the path from city to country living

SPONSORED CONTENT

Bravery in bytes, pioneering the future of girls in ICT

Capitalising on global opportunities: The case for South Africans investing in global income assets

Global Tech and Healthcare stocks to buoy equity returns in 2024

Tax-free Saving is a gift from the government – Maximise your time frame!

SA’s water shortages – recovering, recapturing and reusing as strategies.

Investigations

Investigations

News & Analysis

News & Analysis

Features

Features

Newsletters

Newsletters

Community

Community

DM168

DM168

Gauteng! Brace yourselves for The Premier Debate!

This could have been a paywall

On another site this would have been a paywall. Maverick Insider keeps our content free for all.

Defend Truth

Opinionista

A culture of ethical AI research can counter dangerous algorithms designed to deceive

Is there a context in which algorithms could be designed to deceive, and what are the ethics of this? Where do we draw a line between a good lie and a bad lie, and what are the ethics of a good lie?

Countering damaging deception

Understanding behaviours

Comments - Please login in order to comment.

Top Reads This Hour

Dr Yakub Essack — Gift of the Givers medical team leader leaves a legacy of unmatched kindness

Following the path from city to country living

AirFryday: Glazed carrots in your air fryer, with a rosemary trick

ANC National Disciplinary Committee summons Zuma over MK party support

The looting of Prasa and the terrible price SA has been forced to pay

TOP READS IN SECTION

Dr Yakub Essack — Gift of the Givers medical team leader leaves a legacy of unmatched kindness

Leaked audio exposes ANC election plan for government PR events to showcase successes

Nearly R5-million cash, luxury watches seized as 28s gang boss accused Ralph Stanfield’s brother arrested

ANC National Disciplinary Committee summons Zuma over MK party support

Following the path from city to country living

SPONSORED CONTENT

Bravery in bytes, pioneering the future of girls in ICT

Capitalising on global opportunities: The case for South Africans investing in global income assets

Global Tech and Healthcare stocks to buoy equity returns in 2024

Tax-free Saving is a gift from the government – Maximise your time frame!

SA’s water shortages – recovering, recapturing and reusing as strategies.

Investigations

Investigations

News & Analysis

News & Analysis

Features

Features

Newsletters

Newsletters

Community

Community

DM168

DM168

Please peer review 3 community comments before your comment can be posted

Gauteng! Brace yourselves for The Premier Debate!

This could have been a paywall

On another site this would have been a paywall. Maverick Insider keeps our content free for all.