Why AI Content Is Prone To Errors, Misconceptions And Falsehoods
by Shirley Gibbins on 23-Oct-2023 12:30:00
Artificial Intelligence (AI) has revolutionised many elements of marketing, including (for better or worse) content creation. As the popular generative AI application ChatGPT approaches its first birthday, it’s worth exploring one of our main reservations about the use of generative AI in digital marketing – and one that has not so far been adequately addressed by AI software developers.
This is the susceptibility of AI-generated content to errors, misconceptions and falsehoods.
In this article, we’ll examine some of the main reasons why this is the case, and why you should be extremely cautious before assigning your name to any content generated by a software program.
Out Of Date Or Irrelevant Material
Generative AI applications are not search engines, so have no access to real-time information. This makes them unsuited to any news-type content, or content that requires access to the latest information – e.g. concerning compliance, trends, and so on. ChatGPT and other AI applications use a Natural Language Processing (NLP) model, which generates content based on the data they have been trained on. These applications are coded to learn patterns, structures, and nuances from this data, which they then use to predict an appropriate answer to the user's query. If the source material is out of date, the AI system will reflect these inaccuracies in its output. Unfortunately, an AI system, devoid of cognitive abilities, has no ability to inherently recognise whether the information it is using is relevant to the query or not.
For example, an AI application trained on out-of-date industry regulations will not be aware of recent changes, leading to the creation of content that is no longer accurate. As a human being and a professional in your sector, you would know, but a software algorithm has no way of judging. From the AI’s perspective, if the answer is in its dataset and it sounds like it fits your query, then that’s what you’ll get! How often the datasets of ChatGPT and other applications are updated and refreshed is an interesting question, but there’s no getting away from the pitfall that AIs are only as good as their most recent dataset.
To get around the issue of redundant and obsolete data, many AI systems operate on a last in, first out (LIFO) mechanism, which gives heavy precedent to their most recent training data at the expense of older material. Unfortunately, this can lead to the propagation of low quality or inaccurate information within AI generated content simply because this is ‘newer’ than evergreen content that might be more accurate or relevant.
Low Quality And Inaccurate Source Material
The primary datasets used by AI applications are largely drawn from the Internet, and unfortunately, the Internet is rife with misinformation, inaccuracies, and outright lies. As it stands, AI software isn’t capable of distinguishing between truth and falsehood. It’s all just data to the software, and if the algorithm considers that a particular item of data has the highest probability of matching the user’s input query, then it will be included in the content, true or not.
Quality is also an issue for AI applications that draw on content harvested from the Internet, as the Internet is overwhelmingly populated with overused stock phrases – such as unlock your productivity, level up, drive efficiency, not all X are created equal etc. This can lead to AI produced content sounding generic and lacking in creativity, subtlety, and individuality.
In principle, AI datasets should have internal validation safeguards, and around 20% of any training dataset is a validation set used to check the accuracy of the model. However, until AI software has the cognitive ability to discern truth from falsehood (and it is far from certain that this will ever be possible), it will continue to be vulnerable to propagating these inaccuracies.
Deliberate False Information?
A controversial and unexpected question surrounding AI programming is whether the program algorithms themselves may be incentivised to tell ‘lies’. Let’s be clear, generative AI software does not lie in the sense that they deliberately invent false data, and there is no intention behind what they do. At this stage, AI applications are still software.
However, the applications do create outputs based on the patterns they learn from their training data, and this sometimes results in outputs that are not factual or accurate, or that contain harmful biases. All current generative AI programs use a method called Reinforcement Learning from Human Feedback (RLHF) to fine tune the output produced from the dataset. RHLF works by increasing the probability of the AI producing the best answer, by using a reinforcement or reward-learning algorithm in which the software learns from feedback to adjust the parameters of its model. The idea behind this is to encourage the software to generate more accurate and relevant responses based on the feedback it has received, e.g. up votes and down votes, ranking different outputs etc. (If you’re interested, a deep dive into RHLF can be found here.)
The unfortunate side effect of the RHLF model is that the software could be inadvertently trained to value outputs that are biased, stereotyped and factually incorrect. This is because the AI is programmed to seek out positive feedback, not to discern truth and falsehood. Partly this issue lies in the quality of human feedback – if feedback is inconsistent, biased or inaccurate, then the AI software’s performance will suffer. But another issue that still needs to be explored in greater depth is ‘reward hacking’ (deep dive here), which is a software glitch in which the AI attempts to ‘game’ the reward system by figuring out shortcuts to achieve high levels of feedback without genuinely fulfilling the intended task – e.g. the AI could be incentivised to create a plausible sounding but completely fabricated piece of content, in order to secure up votes.
Large language models (e.g. ChatGPT) are also known to produce ‘hallucinations’, such as fictitious publications, non-existent websites, professional associations and acronyms, biographical information, and other information that sounds worryingly plausible but is impossible to verify, and is often complete nonsense.
What Does This Mean For Your Business?
We’ve gone quite deeply down an AI rabbit hole in this article, but the main takeaway is that you can’t really depend on software to faithfully reflect your brand and convey your value proposition to customers. The content produced by an AI program may or may not be accurate, and even if it is, the chance that it will be expressed in a way that resonates perfectly with your ideal customers is extremely low.
The best way to ensure full brand fidelity in your digital marketing content is to use the services of a professional and experienced commercial writer. If you do use generated content however, please be sure to double and triple check any facts, sources and potential biases before publishing it under your business name.
Elite Content Writing Services From JDR
To find out more about our elite content writing and creation services and how we can support your business to achieve its goals, please call 01332 982247 today.
Image Source: Canva
- Inbound Marketing (SEO, PPC, Social Media, Video) (810)
- Strategy (350)
- Marketing Automation & Email Marketing (183)
- Sales & CRM (179)
- Website Design (157)
- Business Growth (148)
- Hubspot (129)
- Lead Generation (110)
- Google Adwords (97)
- Content Marketing (93)
- News (46)
- Case Studies (44)
- Conversion (43)
- Ecommerce (36)
- Webinars (31)
- SEO (23)
- Events (19)
- Video (17)
- LinkedIn Advertising (15)
- Video Selling (15)
- AI (14)
- Software training (13)
- Niche business marketing (11)
- The Digital Prosperity Podcast (10)
- Facebook Advertising (6)
- HubSpot Case Studies (2)
- September 2025 (9)
- August 2025 (14)
- July 2025 (14)
- June 2025 (5)
- May 2025 (19)
- April 2025 (15)
- March 2025 (13)
- February 2025 (13)
- January 2025 (8)
- December 2024 (2)
- November 2024 (4)
- October 2024 (21)
- September 2024 (4)
- August 2024 (8)
- July 2024 (14)
- June 2024 (16)
- May 2024 (25)
- April 2024 (15)
- March 2024 (18)
- February 2024 (5)
- January 2024 (10)
- December 2023 (6)
- November 2023 (10)
- October 2023 (13)
- September 2023 (12)
- August 2023 (14)
- July 2023 (13)
- June 2023 (14)
- May 2023 (15)
- April 2023 (13)
- March 2023 (14)
- February 2023 (13)
- January 2023 (15)
- December 2022 (13)
- November 2022 (6)
- October 2022 (8)
- September 2022 (22)
- August 2022 (15)
- July 2022 (13)
- June 2022 (16)
- May 2022 (14)
- April 2022 (16)
- March 2022 (17)
- February 2022 (11)
- January 2022 (8)
- December 2021 (6)
- November 2021 (7)
- October 2021 (11)
- September 2021 (10)
- August 2021 (7)
- July 2021 (7)
- June 2021 (4)
- May 2021 (4)
- April 2021 (1)
- March 2021 (3)
- February 2021 (5)
- January 2021 (4)
- December 2020 (7)
- November 2020 (6)
- October 2020 (5)
- September 2020 (9)
- August 2020 (18)
- July 2020 (17)
- June 2020 (17)
- May 2020 (10)
- April 2020 (21)
- March 2020 (24)
- February 2020 (21)
- January 2020 (12)
- December 2019 (23)
- November 2019 (12)
- October 2019 (14)
- September 2019 (16)
- August 2019 (15)
- July 2019 (13)
- June 2019 (6)
- May 2019 (8)
- April 2019 (4)
- March 2019 (2)
- February 2019 (2)
- January 2019 (2)
- December 2018 (3)
- November 2018 (24)
- September 2018 (11)
- August 2018 (9)
- June 2018 (3)
- May 2018 (6)
- April 2018 (14)
- March 2018 (12)
- February 2018 (16)
- January 2018 (15)
- December 2017 (15)
- November 2017 (18)
- October 2017 (23)
- September 2017 (19)
- August 2017 (28)
- July 2017 (27)
- June 2017 (25)
- May 2017 (18)
- April 2017 (17)
- March 2017 (16)
- February 2017 (17)
- January 2017 (14)
- December 2016 (21)
- November 2016 (27)
- October 2016 (25)
- September 2016 (16)
- August 2016 (20)
- July 2016 (19)
- June 2016 (14)
- May 2016 (20)
- April 2016 (24)
- March 2016 (22)
- February 2016 (28)
- January 2016 (27)
- December 2015 (28)
- November 2015 (19)
- October 2015 (9)
- September 2015 (12)
- August 2015 (5)
- July 2015 (1)
- June 2015 (10)
- May 2015 (3)
- April 2015 (11)
- March 2015 (14)
- February 2015 (15)
- January 2015 (12)
- December 2014 (2)
- November 2014 (23)
- October 2014 (2)
- September 2014 (2)
- August 2014 (2)
- July 2014 (2)
- June 2014 (7)
- May 2014 (14)
- April 2014 (14)
- March 2014 (7)
- February 2014 (2)
- January 2014 (7)
- December 2013 (9)
- November 2013 (14)
- October 2013 (17)
- September 2013 (3)
- August 2013 (6)
- July 2013 (8)
- June 2013 (4)
- May 2013 (3)
- April 2013 (6)
- March 2013 (6)
- February 2013 (7)
- January 2013 (5)
- December 2012 (3)
- November 2012 (2)
- September 2012 (1)
Subscribe by email
You May Also Like
These Related Blogs

Will My Website Be Penalised By Google For Using AI Content?
Ever since ChatGPT burst onto the scene in November 2022, two of the main questions surrounding the use of AI content in marketing are: Can Google det …

Repurposing Old Content For Different Marketing Channels
In an increasingly connected, digital world, generating content is one of the most effective ways of reaching and converting potential clients into lo …

Create Content That Attracts The Attention Of Your Ideal Customer
Creating content that actually wants to be read is a great conundrum for all writers of all ages – and in the age of digital sales, it has also become …