Perplexity AI Under Fire for Unethical Practices

Deep Learning With The Wolf

0:00

-3:46

Perplexity AI Under Fire for Unethical Practices

Diana Wolf Torres

Jun 20, 2024

Transcript

Perplexity AI, a startup with significant investments, including from Jeff Bezos' family fund and Nvidia, is under scrutiny for allegedly scraping content without permission and fabricating information.

Perplexity's CEO describes it as an "answer engine," but investigations reveal it violates the Robots Exclusion Protocol, scraping restricted content. It often paraphrases or inaccurately summarizes articles without proper attribution, sometimes fabricating details entirely.

I used to fact-check my articles using Perplexity. But, I stopped this practice when Perplexity started checking them against outdated articles or producing inaccurate results. I now start my research on ChatGPT and then double-check all the links to ensure all of them are real. If I need another opinion on the quality of my writing, I ask Claude. I haven't used Perplexity in over two months.

Forbes, among others, has accused Perplexity of plagiarism. The chatbot sometimes creates content based on web traces rather than actual articles. Perplexity's use of unpublicized IP addresses to access restricted content raises ethical and legal concerns.

The Robots Exclusion Protocol

The Robots Exclusion Protocol (REP) is a standard used by websites to manage and block web crawlers from accessing specific parts of the site. It involves a robots.txt file placed at the root of the website, which contains rules about which pages should not be crawled by web robots. For example, sensitive information or pages that the website owner does not want to appear in search engine results can be protected this way. This protocol is critical for respecting user privacy and data security, ensuring that only authorized bots access the content. Despite these rules, Perplexity AI has been found to ignore these directives, leading to unauthorized scraping and ethical violations.

A robot attempts to use an exclusion protocol to block web crawlers. #DALL-E for #DeepLearningDaily.

Challenges and Future Outlook

The ethical implications of AI systems scraping and fabricating content are significant. Ensuring accurate, attributed content is vital for AI credibility. The Perplexity case highlights the need for stricter regulations and guidelines in AI development to protect intellectual property and maintain ethical standards.

The accusations against Perplexity have several implications:

Trustworthiness of AI: If AI tools like Perplexity can't be trusted to provide accurate and ethically sourced information, their utility and credibility are compromised.
Legal Risks: Violating the Robots Exclusion Protocol and scraping content without permission could lead to legal consequences, including lawsuits from content creators and publishers.
Impact on Content Creators: Unauthorized use of content can harm the revenue and reputation of original content creators. It undermines the work of journalists, artists, and other creators by repurposing their content without proper credit or compensation.
Regulatory Scrutiny: Such cases may prompt stricter regulations and oversight on AI and tech companies to ensure compliance with ethical standards and respect for intellectual property.

Final Thoughts

The Perplexity AI controversy underscores the importance of ethical practices in AI development. Companies must prioritize transparency, respect for intellectual property, and accuracy in content generation. As AI continues to evolve, establishing and adhering to ethical guidelines will be crucial in maintaining the technology's integrity and trustworthiness.

Crafted by Diana Wolf Torres: Merging human expertise with AI

Vocabulary Key:

Robots Exclusion Protocol: A standard used by websites to manage and block web crawlers from accessing certain parts of the site.
IP Address: A unique address that identifies a device on the internet or a local network.
Hallucination (in AI): When an AI generates information that is not based on factual data.

FAQs:

What is Perplexity AI accused of? Plagiarism, content scraping, and fabricating information.
How does Perplexity violate web standards? By ignoring the Robots Exclusion Protocol.
What have investigations revealed about Perplexity's practices? It accesses restricted content and inaccurately summarizes articles.
Who has accused Perplexity of plagiarism? Forbes, WIRED, among others.
Why is this issue important? It raises ethical and legal concerns about AI practices.

Additional Resources for Inquisitive Minds:

Robots Exclusion Protocol. Internet Engineering Task Force. (September 2022.)

The Prompt: Perplexity’s Plagiarism Problem. FORBES. (June 11, 2024.)

Perplexity Is a Bullsh*t Machine. WIRED. (June 19. 2024.)

Follow @DeepLearningDaily on YouTube.

Follow @DeeplearningwiththeWolf on Spotify. “Deep Learning with the Wolf” will cover the Perplexity AI controversy, breaking down the investigation findings and discussing the ethical implications in an accessible, engaging format. Join the Wolf Pack!

Perplexity AI Under Fire for Unethical Practices

Additional Resources for Inquisitive Minds:

Discussion about this episode