The lid has been lifted on the Pandora’s box of text-to-image AI

Concerns raised by artists about text-to-image artificial intelligence are reminiscent of the fears of the Luddites, who opposed the use of mechanical looms out of fear for their livelihoods. Like mechanical looms, text-to-image AIs are here to stay and will continue to progress and improve in ability.

In the late 18th and early 19th Centuries, as the First Industrial Revolution began dramatically increasing production of items such as cotton clothing, a movement of textile workers and weavers arose which protested against new technology and machinery that they believed was threatening their jobs, lowering wages, increasing unemployment, and reducing the quality of products.

The Luddites, as they came to be known, were so named after Ned Ludd, a mythical figure who was said to have destroyed two stocking frames in 1779 in protest against the mechanical knitting machines that were being introduced at the time.

The emergent Luddite movement responded to rapid changes that technology brought to the economy and society through staged protests and attacks on factories and machinery, sometimes destroying the equipment that was seen as a threat to livelihoods.

The Luddite movement eventually declined in the mid-1820s as it was impossible to reverse that course of innovation and the economic imperative to adopt these technologies drastically outweighed the concerns of resistant workers when compared to the progress of society as a whole.

Imagine if we had stalled the development of steam power to protect those textile workers, the consequential impact on modern society, trade, standards of living and science is unfathomable.

The term “Luddite” has continued to be used to refer to people who are opposed to technological innovation or who are resistant to the systemic change caused thereby. In the years since, we have seen many similar movements arise whenever innovation makes prior systems of labour or revenue generation obsolete. Contemporary examples include the opposition by metered taxi operators to Uber or coal truck drivers to renewable power generation.

Enter the rise of artificial intelligence (AI). In the last few months, the opposition to AI on the basis of it making existing means of work redundant has entered mainstream discourse in part due to the meteoric rise in popularity of text-to-image AI. These AI can turn words into drawings, or create fantastical images of one in exotic locales, all within seconds.

Very quickly, an outcry from digital artists arose, centred on two points: that the AI plagiarises the work of human artists and that people should rather commission actual human artists than pay for an AI app.

The arguments surrounding plagiarism focus on how the AI is trained and generates output images. So it is useful to first establish how this occurs. This is an incredibly complex and advanced area of computer programming, but to try and explain it as simply as possible, the current text-to-image AIs are based on generative adversarial networks (GANs).

Text-to-image GANs are a type of machine learning model that can generate images based on a given text description. They are made up of two neural networks — simply think of them as two separate computer programs which interact with each other: a generator that creates the images and a discriminator that tries to distinguish real images from the generated ones.

To train a text-to-image GAN, you need a dataset of images and corresponding text descriptions. The training process involves using the generator to create images based on the text descriptions and then presenting both the generated and real images to the discriminator. The discriminator tries to identify which images are real and which are fake, while the generator tries to create images that can fool the discriminator.

As the training process continues, the generator gets better at creating realistic images and the discriminator gets better at distinguishing real from fake images. This process is repeated until the generator is able to create images that are almost indistinguishable from real ones.

An argument which has been advanced against these AIs is that they are simply splicing together existing images using the text descriptions, like some sort of advanced Photoshop. This is untrue.

The starting point for the generator with every image is random noise, basically an image which would look like static from an untuned CRT TV. From there it attempts to find patterns in the noise which resemble the patterns it has associated with a given text description. But how does it determine what those patterns are?

To provide a very simplified example, let us say we ask it to draw a cat. To do so we need to take an image of a cat and provide it to the discriminator model and say to it, this is a cat, now try and determine from this image and a bunch more real images of cats, whether the images from the generator are a cat or not.

The generator then submits its random noise which the discriminator easily identifies. The generator then changes a few parts of the image and submits again. We then repeat this thousands of times over until the generator has, through trial and error, refined how it creates cats from the random noise, to fool the discriminator into believing its images are actually of a cat.

This, I would argue, is no different to how a human artist learns to create art — over years in childhood, learning what a cat looks like by being shown images (or real life) cats and told these are cats, and from that attempting to create depictions of the cats, and over time refining their ability to create, from nothing, a depiction of a cat in a chosen medium.

Further, this often involves referring to reference works, depictions by others of cats, seeing how they drew/painted/sculpted etc, the cat. Artistic creation is fundamentally an iterative process, iterating both on one’s own work and on the work of other artists.

The argument for plagiarism in AI art which has gained the most traction comes from the provision of existing reference works to the discriminator. As described above, we need to tell the discriminator what a cat is by providing existing images of a cat taken/created by a human.

Visit Daily Maverick’s home page for more news, analysis and investigations

Many of these artists, so the argument goes, did not consent to their art being used to help train these AI, to which it must be asked, is their consent required?

Remember, the existing works are not being used in any way except to help the AI understand what the text appears like in an image form. It is not being spliced together nor reproduced, but instead is being used to create the conceptual framework upon which the generator interprets patterns from random noise to align to the text prompt.

This can range from the vagueness of knowing that many artistic images have a signature in bottom left-hand corner (which AIs often try to replicate, adding their own signature of sorts to the artwork as that is what convention dictates for those styles of artwork), to the specific such as the artistic style of a single prolific artist.

This, I would argue, is no different from an art student studying the works of Rembrandt to gain an understanding of his use of light and shadow, or Picasso for his use of colour and form to convey meaning and emotion, and then replicating those features in their own artwork or using it to better inform their own work. Neither of these artists gave consent for their artwork to be used in this way, nor was it needed.

The reality is that once an artist makes and displays their artwork, it is out there for the world to study and critique. Whether this is done by a human artist or a computer one, is in my mind, irrelevant. To snip the commercialisation argument, human artists are equally empowered to go on and sell their works.

Fundamentally what informs the output of the AI is not really the images which were fed to it, but the text prompt used by the generator to guide the patterns into which it should curate the random noise.

Hence, if I ask it to create The Scream by Edvard Munch, it would give me an image which looks strikingly similar, but not identical, to the original artwork because it is pulling from those patterns. If I asked an art student to do the same, I would also get an image (depending on the skill level of the student) which would also be strikingly similar, but not exactly the same as the original.

Here the potential plagiarism is not from the ability to create the artwork, but from the prompt given and the intention of the use thereof.

The important distinction which needs to be made is that the tool itself does not plagiarise, but it can be used for plagiarism. The personification of AI means that the above distinction is often missed.

The broader conversation to be had here then is not around the tool plagiarising, but rather about users using it to plagiarise and how we can gain the benefits of the technology while limiting nefarious uses thereof. This is a conversation relevant to all technology, but very rarely does it result in the resolution to not utilise the technology at all.

The second critique of rather commissioning a human artist is perhaps more genuine to the concerns at hand. The concerns of these artists are rooted in fear that their livelihoods will no longer be viable, their skills no longer of value or more profoundly that art itself will lose its meaning or value due to no longer being human-created.

I can sympathise with the fear. Being a lawyer, I am already seeing tools coming which are dramatically altering our workflow and can easily envisage a near future in which clients inquire from an AI how to address their legal issues instead of me. As with the Luddites, this fear itself is not justification for the abandonment of promising technologies.

It is also important to note that new technologies often create new job opportunities and industries that didn’t exist before. Text-to-image AI tools will be able to augment rather than replace human art practices. They may allow artists to work more efficiently, freeing up time for them to focus on more creative and complex tasks.

We are already seeing this manifest through an explosion within the VFX industry of better quality VFX shots created with fewer resources. This has lowered the barrier to entry for independent artists to produce amazing work which has in turn exponentially increased high-quality visual media being created and shared.

As for the concern that art itself will lose its value or meaning if it is no longer solely created by humans, one must recognise that art has always evolved alongside technological advances. From the invention of the printing press to the rise of photography, art has always found new ways to incorporate and respond to new technologies. AI text-to-image tools are simply the latest example of this.

While they may challenge our traditional notions of what constitutes “authentic” art, they also offer new opportunities for artists to push the boundaries of creative expression. Ultimately, the value of art lies in its ability to evoke emotion and inspire thought, regardless of the tools used to create it.

In conclusion, the concerns raised by artists about the use of text-to-image AIs are reminiscent of the fears of the Luddites, who opposed the use of mechanical looms out of fear for their livelihoods.

Just as with the mechanical looms, text-to-image AIs are here to stay and will continue to progress and improve in ability. Like Pandora’s box, the technology has been opened and cannot be shut again.

Text-to-image AIs do not plagiarise existing works and have the potential to augment rather than replace human artists. While it’s true that new technologies can sometimes lead to economic disruption, they also create new opportunities and industries that didn’t exist before.

Rather than lamenting their existence or trying to stop their progression, it’s crucial that we embrace these tools and have a productive discourse about how they can be used to enhance the art world and expand the boundaries of creative expression. DM

Cookie	Duration	Description
__cfduid	1 month	The cookie is used by cdn services like CloudFlare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
_GRECAPTCHA	5 months 27 days	This cookie is set by Google. In addition to certain standard Google cookies, reCAPTCHA sets a necessary cookie (_GRECAPTCHA) when executed for the purpose of providing its risk analysis.
cid	1 year	This is an important cookie in making credit card transaction on the website. It allows the online transaction without storing the credit card information.This service is provided by Stripe.com.
connect.sid	1 month	This cookie is used for authentication and for secure log-in. It registers the log-in information.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
elementor	never	This cookie is used by the website's WordPress theme. It allows the website owner to implement or change the website's content in real-time.
JSESSIONID	session	Used by sites written in JSP. General purpose platform session cookies that are used to maintain users' state across page requests.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__atuvc	1 year 1 month	This cookie is set by Addthis to make sure you see the updated count if you share a page and return to it before our share count cache is updated.
__atuvs	30 minutes	This cookie is set by Addthis to make sure you see the updated count if you share a page and return to it before our share count cache is updated.
__cf_bm	30 minutes	This cookie is set by CloudFlare. The cookie is used to support Cloudflare Bot Management.
__pvi	1 day	This cookie is used for the implementation of the news content from other sites.
bcookie	2 years	This cookie is set by linkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
lidc	1 day	This cookie is set by LinkedIn and used for routing.

Cookie	Duration	Description
__gads	1 year 24 days	This cookie is set by Google and stored under the name dounleclick.com. This cookie is used to track how many times users see a particular advert which helps in measuring the success of the campaign and calculate the revenue generated by the campaign. These cookies can only be read from the domain that it is set on so it will not track any data while browsing through another sites.
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_ga_Y7XD5FHQVG	2 years	This cookie is installed by Google Analytics.
_gat_UA-10686674-1	1 minute	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
ajs_anonymous_id	never	This cookie is set by Segment.io to check the number of ew and returning visitors to the website.
ajs_user_id	never	The cookie is set by Segment.io and is used to analyze how you use the website
ANON_ID	3 months	This cookie is provided by Tribalfusion. The cookie is used to give a unique number to visitors, and collects data on user behaviour like what page have been visited. This cookie also helps to understand which sale has been generated by as a result of the advertisement served by third party.
jam_heavy_ga_session	5 years	This cookie is installed by Google Analytics.
UserID1	3 months	The cookie sets a unique anonymous ID for a website visitor. This ID is used to continue to identify users across different sessions and track their activities on the website. The data collected is used for analysis.
uvc	1 year 1 month	The cookie is set by addthis.com to determine the usage of Addthis.com service.

Cookie	Duration	Description
__tbc	2 years	This cookie is used for measuring the efficiency of advertisement by registering data on visitors from multiple website.
_cc_aud	8 months 26 days	The cookie is set by crwdcntrl.net. The purpose of the cookie is to collect statistical information in an anonymous form about the visitors of the website. The data collected include number of visits, average time spent on the website, and the what pages have been loaded. These data are then used to segment audiences based on the geographical location, demographic, and user interest provide relevant content and for advertisers for targeted advertising.
_cc_cc	session	The cookie is set by crwdcntrl.net. The purpose of the cookie is to collect statistical information in an anonymous form about the visitors of the website. The data collected include number of visits, average time spent on the website, and the what pages have been loaded. These data are then used to segment audiences based on the geographical location, demographic, and user interest provide relevant content and for advertisers for targeted advertising.
_cc_dc	8 months 26 days	The cookie is set by crwdcntrl.net. The purpose of the cookie is to collect statistical information in an anonymous form about the visitors of the website. The data collected include number of visits, average time spent on the website, and the what pages have been loaded. These data are then used to segment audiences based on the geographical location, demographic, and user interest provide relevant content and for advertisers for targeted advertising.
_cc_id	8 months 26 days	The cookie is set by crwdcntrl.net. The purpose of the cookie is to collect statistical information in an anonymous form about the visitors of the website. The data collected include number of visits, average time spent on the website, and the what pages have been loaded. These data are then used to segment audiences based on the geographical location, demographic, and user interest provide relevant content and for advertisers for targeted advertising.
_kuid_	5 months 27 days	The cookie is set by Krux Digital under the domain krxd.net. The cookie stores a unique ID to identify a returning user for the purpose of targeted advertising.
_rxuuid	1 year	The main purpose of this cookie is targeting, advertesing and effective marketing. This cookie is used to set a unique ID to the visitors, which allow third party advertisers to target the visitors with relevant advertisement up to 1 year.
ANON_ID_old	3 months	This cookie helps to categorise the users interest and to create profiles in terms of resales of targeted marketing. This cookie is used to collect user information such as what pages have been viewed on the website for creating profiles.
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
CMID	1 year	The cookie is set by CasaleMedia. The cookie is used to collect information about the usage behavior for targeted advertising.
CMPRO	3 months	This cookie is set by Casalemedia and is used for targeted advertisement purposes.
CMPS	3 months	This cookie is set by Casalemedia and is used for targeted advertisement purposes.
CMST	1 day	The cookie is set by CasaleMedia. The cookie is used to collect information about the usage behavior for targeted advertising.
DSID	1 hour	This cookie is setup by doubleclick.net. This cookie is used by Google to make advertising more engaging to users and are stored under doubleclick.net. It contains an encrypted unique ID.
google_push	5 minutes	This cookie is set by the Bidswitch. This cookie is used to collect statistical data related to the user website visit such as the number of visits, average time spent on the website and what pages have been loaded. This collected information is used to sort out the users based on demographics and geographical locations inorder to serve them with relevant online advertising.
i	1 year	The purpose of the cookie is not known yet.
id	3 months	The main purpose of this cookie is targeting and advertising. It is used to create a profile of the user's interest and to show relevant ads on their site. This Cookie is set by DoubleClick which is owned by Google.
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
IDSYNC	1 year	This cookie is used for advertising purposes.
KADUSERCOOKIE	3 months	The cookie is set by pubmatic.com for identifying the visitors' website or device from which they visit PubMatic's partners' website.
KTPCACOOKIE	1 day	This cookie is set by pubmatic.com for the purpose of checking if third-party cookies are enabled on the user's website.
ljt_reader	1 year	This is a Lijit Advertising Platform cookie. The cookie is used for recognizing the browser or device when users return to their site or one of their partner's site.
loc	1 year 1 month	This cookie is set by Addthis. This is a geolocation cookie to understand where the users sharing the information are located.
mc	1 year 1 month	This cookie is associated with Quantserve to track anonymously how a user interact with the website.
mt_mop	1 month	Stores information about how the user uses the website such as what pages have been loaded and any other advertisement before visiting the website for the purpose of targeted advertisements.
personalization_id	2 years	This cookie is set by twitter.com. It is used integrate the sharing features of this social media. It also stores information about how the user uses the website for tracking and targeting.
suid_legacy	1 year	This cookie is used to collect information on user preference and interactioin with the website campaign content. This cookie is used for promoting events and products by the webiste owners on CRM-campaign-platform.
TDCPM	1 year	The cookie is set by CloudFlare service to store a unique ID to identify a returning users device which then is used for targeted advertising.
TDID	1 year	The cookie is set by CloudFlare service to store a unique ID to identify a returning users device which then is used for targeted advertising.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
tluid	3 months	This cookie is set by the provider AdRoll.This cookie is used to identify the visitor and to serve them with relevant ads by collecting user behaviour from multiple websites.
tuuid	1 year	This cookie is set by .bidswitch.net. The cookies stores a unique ID for the purpose of the determining what adverts the users have seen if you have visited any of the advertisers website. The information is used for determining when and how often users will see a certain banner.
tuuid_lu	1 year	This cookie is set by .bidswitch.net. The cookies stores a unique ID for the purpose of the determining what adverts the users have seen if you have visited any of the advertisers website. The information is used for determining when and how often users will see a certain banner.
uid	5 months 27 days	This cookie is used to measure the number and behavior of the visitors to the website anonymously. The data includes the number of visits, average duration of the visit on the website, pages visited, etc. for the purpose of better understanding user preferences for targeted advertisments.
uuid	1 year 27 days	To optimize ad relevance by collecting visitor data from multiple websites such as what pages have been loaded.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.
wfivefivec	1 year 1 month	The domain of this cookie is owned by Dataxu. The main business activity of this cookie is targeting and advertising. This cookie tracks the advertisement report which helps us to improve the marketing activity.
xbc	2 years	This cookie is used for optmizing the advertisement on the website more relevant by analysing the user behaviour and interaction with the website.
YSC	session	This cookies is set by Youtube and is used to track the views of embedded videos.

Cookie	Duration	Description
__browsiSessionID	30 minutes	No description available.
__browsiUID	1 year	No description available.
__cflb	23 hours	This cookie is used by Cloudflare for load balancing.
__gpi	1 year 24 days	No description
ajs_group_id	never	This cookie is set by Segment.io. The purpose of the cookie is currently not identified.
blkbs	6 days 23 hours	No description
charitable_session	1 day	No description available.
cookietest	session	No description
debug	never	No description available.
gCStest	7 years 1 month 26 days 16 hours	No description
muc_ads	2 years	No description
revengine_browser_id	session	No description
revengine-browser-token	session	RevEngine Data Tool.
rl_user_id	never	No description available.
tf_respondent_cc	6 months	No description
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
vic_loc_error	10 minutes	No description
vicinity_id	1 year 10 months 24 days 11 hours	Vicinity Advertising.

Defend Truth

Opinionista

The lid has been lifted on the Pandora’s box of text-to-image AIs and will never be shut again

Comments - Please login in order to comment.

Top Reads This Hour

Snuffed out — Starlink’s South African customers out in the cold as Musk’s company terminates unapproved service

Thirty years after democracy, fed-up Northern Cape residents thirst for more

ANC drops to 40.2% in support poll as Zuma’s MK campaigns to cannibalise ruling party votes

Gloves come off as SA’s Takealot and Zando tackle global juggernauts Temu, Shein and Amazon

Zuma’s MK party and unhappy voters whack ANC to 40.2% in latest Ipsos poll

TOP READS IN SECTION

ANC drops to 40.2% in support poll as Zuma’s MK campaigns to cannibalise ruling party votes

Gloves come off as SA’s Takealot and Zando tackle global juggernauts Temu, Shein and Amazon

Zuma’s MK party and unhappy voters whack ANC to 40.2% in latest Ipsos poll

The night I discovered the most perfect lamb chop in the world

Small acts — Buddhist Retreat Centre founder Louis van Loon was driven by compassion

SPONSORED CONTENT

SA’s water shortages – recovering, recapturing and reusing as strategies.

Enable personalised engagements today: get a cost-effective UC&C solution that works with your business systems

SA’s most dedicated courier braves Mdumbi River to deliver parcel

The funding gap has left too many students in SA behind. This needs to change

CEO’s – Successes and Screw Ups: Episode 2 – Holding onto your IP

Investigations

Investigations

News & Analysis

News & Analysis

Features

Features

Newsletters

Newsletters

Community

Community

DM168

DM168

Daily Maverick needs your support

Join the Gauteng Premier Debate.

Defend Truth

Opinionista

The lid has been lifted on the Pandora’s box of text-to-image AIs and will never be shut again

Comments - Please login in order to comment.

Top Reads This Hour

Snuffed out — Starlink’s South African customers out in the cold as Musk’s company terminates unapproved service

Thirty years after democracy, fed-up Northern Cape residents thirst for more

ANC drops to 40.2% in support poll as Zuma’s MK campaigns to cannibalise ruling party votes

Gloves come off as SA’s Takealot and Zando tackle global juggernauts Temu, Shein and Amazon

Zuma’s MK party and unhappy voters whack ANC to 40.2% in latest Ipsos poll

TOP READS IN SECTION

ANC drops to 40.2% in support poll as Zuma’s MK campaigns to cannibalise ruling party votes

Gloves come off as SA’s Takealot and Zando tackle global juggernauts Temu, Shein and Amazon

Zuma’s MK party and unhappy voters whack ANC to 40.2% in latest Ipsos poll

The night I discovered the most perfect lamb chop in the world

Small acts — Buddhist Retreat Centre founder Louis van Loon was driven by compassion

SPONSORED CONTENT

SA’s water shortages – recovering, recapturing and reusing as strategies.

Enable personalised engagements today: get a cost-effective UC&C solution that works with your business systems

SA’s most dedicated courier braves Mdumbi River to deliver parcel

The funding gap has left too many students in SA behind. This needs to change

CEO’s – Successes and Screw Ups: Episode 2 – Holding onto your IP

Investigations

Investigations

News & Analysis

News & Analysis

Features

Features

Newsletters

Newsletters

Community

Community

DM168

DM168

Please peer review 3 community comments before your comment can be posted

Daily Maverick needs your support

Join the Gauteng Premier Debate.