Training». https://arxiv.org/pdf/2104.10350.pdf.
31. «Training language models to follow instructions with human feedback». https://arxiv.org/pdf/2203.02155.pdf.
32. «Aligning language models to follow instructions». https://openai.com/research/instruction-following.
33. «How Large Language Models Will Transform Science, Society, and AI». Stanford HAI. https://hai.stanford.edu/news/how-large-language-models-will-transform-science-society-and-ai.
34. R. Brandl, «ChatGPT Statistics 2023All the latest statistics about OpenAI’s chatbot». Tooltester, Feb. 15, 2023. https://www.tooltester.com/en/blog/chatgpt-statistics/.
35. «Lessons learned on language model safety and misuse». https://openai.com/research/language-model-safety-and-misuse.
36. C. Metz, «The ChatGPT King Isn’t Worried, but He Knows You Might Be». The New York Times, The New York Times, Mar. 31, 2023. (Онлайн). https://www.nytimes.com/2023/03/31/technology/sam-altman-open-ai-chatgpt.html.
37. K. Hu, «ChatGPT sets record for fastest-growing user base – analyst note». Reuters, Reuters, Feb. 02, 2023. (Онлайн). https://www.reuters.com/technology/chatgpt-sets-record-fastest-growing-user-base-analyst-note‐2023-02-01/.
38. «Towards a Human-like Open-Domain Chatbot». https://arxiv.org/pdf/2001.09977.pdf.
39. «LaMDA: Towards Safe, Grounded, and High-Quality Dialog Models for Everything». https://ai.googleblog.com/2022/01/lamda-towards-safe-grounded-and-high.html.
40. B. Lemoine, «Is LaMDA Sentient? – an Interview». Medium, Jun. 11, 2022. https://cajundiscordian.medium.com/is-lamda-sentient-an-interview-ea64d916d917.
41. R. Luscombe, «Google engineer put on leave after saying AI chatbot has become sentient». The Guardian, The Guardian, Jun. 12, 2022. (Онлайн). https://www.theguardian.com/technology/2022/jun/12/google-engineer-ai-bot-sentient-blake-lemoine.
42. S. Pichai, «An important next step on our AI journey». Google, Feb. 06, 2023. https://blog.google/technology/ai/bard-google-ai-search-updates/.
43. N. Grant and C. Metz, «A New Chat Bot Is a „Code Red“ for Google’s Search Business». The New York Times, The New York Times, Dec. 21, 2022. (Онлайн). https://www.nytimes.com/2022/12/21/technology/ai-chatgpt-google-search.html.
44. «Google’s CEO Sundar Pichai on Bard, AI Whiplash, and Competing with ChatGPT». https://www.nytimes.com/2023/03/31/podcasts/hard-fork-sundar.html.
45. «Pathways Language Model (PaLM): Scaling to 540 Billion Parameters for Breakthrough Performance». https://ai.googleblog.com/2022/04/pathways-language-model-palm-scaling-to.html.
46. B. Allyn, «Microsoft’s new AI chatbot has been saying some „crazy and unhinged things“». WAMU 88.5 – American University Radio, Mar. 02, 2023. https://wamu.org/story/23/03/02/microsofts-new-ai-chatbot-has-been-saying-some-crazy-and-unhinged-things/.
47. K. Roose, «Bing’s A. I. Chat: „I Want to Be Alive“». The New York Times, The New York Times, Feb. 16, 2023. (Онлайн). https://www.nytimes.com/2023/02/16/technology/bing-chatbot-transcript.html.
48. M. C. Blogs, «Reinventing search with a new AI-powered Microsoft Bing and Edge, your copilot for the web». The Official Microsoft Blog, Feb. 07, 2023. https://blogs.microsoft.com/blog/2023/02/07/reinventing-search-with-a-new-ai-powered-microsoft-bing-and-edge-your-copilot-for-the-web/.
49. «Twitter: „Tay“ went from „humans are super cool“ to full nazi in <24 hrs and I’m not at all concerned about the future of AI». Twitter. https://twitter.com/geraldmellor/status/712880710328139776/photo/3.
50. «Making search conversational: Finding and chatting with bots on Bing». https://blogs.bing.com/search-quality-insights/2017–05/making-search-conversational-finding-and-chatting-with-bots-on-bing/.
51. T. Warren, «Microsoft has been secretly testing its Bing chatbot Sydney for years». The Verge, Feb. 23, 2023. https://www.theverge.com/ 2023/2/23/23609942/microsoft-bin g-sydney-chatbot-history-ai.
52. Facebook* company, «BlenderBot 3: An AI Chatbot That Improves Through Conversation». Meta [127], Aug. 05, 2022. https://about.fb.com/news/2022/08/blenderbot-ai-chatbot-improves-through-conversation/.
53. «Introducing LLaMA: A foundational, 65‐billion-parameter language model». https://ai.facebook.com/blog/large-language-model-llama-meta-ai/.
54. «LLaMA: Open and Efficient Foundation Language Models». https://arxiv.org/pdf/2302.13971.pdf.
55. A. Hern, «TechScape: Will Meta’s * massive leak democratise AI – and at what cost?». The Guardian, The Guardian, Mar. 07, 2023. (Онлайн). https://www.theguardian.com/technology/2023/mar/07/techscape-meta-leak-llama-chatgpt-ai-crossroads.
56. Facebook* company, «Meta [128] and Microsoft Introduce the Next Generation of Llama». Meta *, Jul. 18, 2023. https://about.fb.com/news/2023/07/llama‐2/.
2. Обучение больших языковых моделей
1. J. Wei et al., «Emergent Abilities of Large Language Models». Jun. 15, 2022. (Онлайн). http://arxiv.org/abs/2206.07682.
2. R. Schaeffer, B. Miranda, and S. Koyejo, «Are Emergent Abilities of Large Language Models a Mirage?». Apr. 28, 2023. (Онлайн). http://arxiv.org/abs/2304.15004.
3. «Language Models are Few-Shot Learners». https://arxiv.org/pdf/2005.14165.pdf.
4. «eDiscovery Best Practices: Perspective on the Amount of Data Contained in 1 Gigabyte». CloudNine, Mar. 05, 2012. https://cloudnine.com/ediscoverydaily/electronic-discovery/ediscovery-best-practices-perspective-on-the-amount-of-data-contained-in‐1‐gigabyte/.
5. «On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?». ACM Digital Library. https://dl.acm.org/doi/pdf/10.1145/ 3442188.3445922.
6. A. Caliskan, J. J. Bryson, and A. Narayanan, «Semantics derived automatically from language corpora contain human-like biases». Science, vol. 356, no. 6334, pp. 183–186, Apr. 2017, doi: https://doi.org/10.1126/science.aal4230.
7. «Persistent Anti-Muslim Bias in Large Language Models». https://arxiv.org/pdf/2101.05783.pdf.
8. «Gender and Representation Bias in GPT‐3 Generated Stories». https://aclanthology.org/2021.nuse‐1.5.pdf.
9. «StereoSet: Measuring stereotypical bias in pretrained language models». https://aclanthology.org/2021.acl-long.416.pdf.
10. «Black Lives Matter in Wikipedia: Collaboration and Collective Memory around Online Social Movements». https://dl.acm.org/doi/pdf/10.1145/2998181.2998232.
11. «Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings». https://arxiv.org/pdf/1607.06520.pdf.
12. «An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models». https://arxiv.org/pdf/2110.08527.pdf.
13. «Hugging Face Dataset Cards». https://huggingface.co/docs/hub/datasets-cards.
14. «The ROOTS Search Tool: Data Transparency for LLMs». https://arxiv.org/pdf/2302.14035.pdf.
15. «Extracting Training Data from Large Language Models». https://arxiv.org/pdf/2012.07805.pdf.
16. «The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks». https://arxiv.org/pdf/1802.08232.pdf.
17. «Protecting privacy in practice: The current use, development and limits of Privacy Enhancing Technologies in data analysis». https://royalsociety.org/-/media/policy/projects/privacy-enhancing-technologies/Protecting-privacy-in-practice.pdf.
18. E. M. Renieris, Beyond Data: Reclaiming Human Rights at the Dawn of the Metaverse. MIT Press, 2023. (Онлайн). https://books.google.com/books/about/Beyond_Data.html?hl=&id=zJZuEAAAQBAJ.
3. Конфиденциальность и безопасность данных в аспекте LLM
1. «OpenAI Chatbot Spits Out Biased Musings, Despite Guardrails». Bloomberg. https://www.bloomberg.com/news/newsletters/2022-12-08/chatgpt-open-ai-s-chatbot-is-spitting-out-biased-sexist-results.
2. A. Askell et al., «A General Language Assistant as a Laboratory for Alignment». Dec. 01, 2021. (Онлайн). http://arxiv.org/abs/2112.00861.
3. H. Ngo et al., «Mitigating harm in language models with conditional-likelihood filtration». Aug. 04, 2021. (Онлайн). http://arxiv.org/abs/2108.07790.
4. T. Korbak et al., «Pretraining Language Models with Human Preferences». Feb. 16, 2023. (Онлайн). http://arxiv.org/abs/2302.08582.
5. P. Christiano, J. Leike, T. B. Brown, M. Martic, S. Legg, and D. Amodei, «Deep reinforcement learning from human preferences». Jun. 12, 2017. (Онлайн). http://arxiv.org/abs/1706.03741.
6. B. Perrigo, «Exclusive: OpenAI Used Kenyan Workers on Less Than $2 Per Hour to Make ChatGPT Less Toxic». Time, Jan. 18, 2023. https://time.com/6247678/openai-chatgpt-kenya-workers.
7. Y. Bai et al., «Constitutional AI: Harmlessness from AI Feedback». Dec. 15, 2022. (Онлайн). http://arxiv.org/abs/2212.08073.
8. P. «Claude’s Constitution». Anthropic, May 09, 2023. https://www.anthropic.com/index/claudes-constitution.
9. C. Xiang, «"He Would Still Be Here": Man Dies by Suicide After Talking with AI Chatbot, Widow Says». VICE, Mar. 30, 2023. https://www.vice.com/en/article/pkadgm/man-dies-by-suicide-after-talking-with-ai-chatbot-widow-says.
10. D. Kundaliya, «Microsoft staff can read Bing chatbot messages». Feb. 28, 2023. https://www.computing.co.uk/news/4076705/microsoft-staff-read-bing-chatbot-messages.
11. «What is ChatGPT?». https://help.openai.com/en/articles/6783457‐what-is-chatgpt.
12. «New ways to manage your data in ChatGPT». https://openai.com/blog/new-ways-to-manage-your-data-in-chatgpt.
13. «Bard FAQ». https://bard.google.com/faq.
14. «Manage & delete your Bard activity». https://support.google.com/bard/answer/13278892.
15. «OpenAI’s Privacy policy». https://openai.com/policies/privacy-policy.
16. E. Dreibelbis, «Samsung Software Engineers Busted for Pasting Proprietary Code Into ChatGPT». PCMag, Apr. 07, 2023. https://www.pcmag.com/news/samsung-software-engineers-busted-for-pasting-proprietary-code-into-chatgpt.
17. B. Wodecki, «JPMorgan Joins Other Companies in Banning ChatGPT». AI Business, Feb. 24, 2023. https://aibusiness.com/verticals/some-big-companies-banning-staff-use-of-chatgpt.
18. «March 2 °ChatGPT outage: Here’s what happened». https://openai.com/blog/march‐20‐chatgpt-outage.
19. «Provvedimento del 30 marzo 2023 [9870832]». https://web.archive.org/web/20230404210519/https://www.gpdp.it:443/web/guest/home/docweb/-/docweb-display/docweb/9870832.
20. «Data Protection Legislation in Sweden: A Statistician’s Perspective». https://www.jstor.org/stable/2982482.
21. «Records, Computers, and Rights of Citizens». https://www.justice.gov/opcl/docs/rec-com-rights.pdf.
22. «Data protection in the EU». European Commission. https://commission.europa.eu/law/law-topic/data-protection/data-protection-eu_en.
23. F. H. Cate, «The Failure of Fair Information Practice Principles». 2006, (Онлайн). https://papers.ssrn.com/abstract=1156972.
24. «California Consumer Privacy Act (CCPA)». State of California – Department of Justice – Office of the Attorney General, Oct. 15, 2018. https://oag.ca.gov/privacy/ccpa.
25. N. Confessore, «Cambridge Analytica and Facebook [129]: The Scandal and the Fallout So Far». The New York Times, Apr. 04,