Instruction
March 2 (Thu) 12:00-1:30 PM CSE376
Participants: Rachael Hong, Rachel McAmis, Liwei Jiang, Kaiming Cheng, Gregor Haas, Tina Yeung, Miranda Wei, Miro Enev, Yoshi Kohno, Inyoung Cheong
[ Primary goal is to get as many different things on the board as possible – optimize for breadth of use cases, stakeholders, datasets, impacts. Use “other ideas” section to record other ideas that are worth discussing later, e.g., ideas about how to build better systems ]
Overview
In this expert panel, we are envisioning the future of generative language models and their use cases. We want to understand both what good things might be in the future but, especially for our purposes, also what bad things might be in the future. When envisioning “bad things”, our goal is to consider situations in which there are no technical or policy mechanisms in place that might prevent, mitigate, or minimize those “bad things”. I.e., even if we do not think that a certain “bad thing” might happen, for any reason (policy or technical), let’s still envision them here.
When we say “generative language models”, we are thinking of large-scale systems that use an understanding of language to generate text (written or spoken or otherwise communicated). That text might be an answer to a question, an answer to a prompt, part of a communications exchange (a dialog), or more – this session is about envisioning those possible use cases, so do not restrict your thinking only to the types of systems we currently encounter.
To do at / before start of meeting
Create the following regions on the whiteboard / wall:
Impacts (areas for “good”, “bad”, “other”) Changes and Resulting Impacts Other ideas, defenses, project ideas, etc
Procedure:
Use cases and stakeholder brainstorming (different regions of whiteboard board / different color post-it notes) [5-10 minutes + discussion] [do not be constrained to current technologies] [stakeholders include users, non-users, companies, governments, M&V populations, other countries] Datasets + incentive structure and impacts brainstorming (different regions of whiteboard board / different color post-it notes) [5-10 minutes + discussion] [datasets means “inputs to training systems” and impacts means “what happens as a result”; impacts can be “good”, “bad”, “both good and bad”, “unclear”]
Results
Use case
Generating social media content / news Meeting notes & agenda making Summarizing articles or other content Reading/writing assistant Summarization (downstream tasks in NLP) Code generation in new domains (hardware? Other misc. Boilerplate? Data generation (reverse prompting?) Inputting problem statement & getting out an algorithm Inputting app spec and outputting app code Writing waivers or other legal documents Automated defense attorneys Making legal decisions based of court script Generating privacy policy, Terms of Services Business document drafting Given symptoms, output diagnosis (automated Web MD) Medical doctor understanding & analysis Advice: what should I do to get promoted? Tell me recipe given ingredients Professional assistant tools for doctors, CPAs, lawyers Menus, food descriptions, nutrition labels Generative tech used in other AI fields e.g., planning for debts Help science communication by translating things into lay language Generating research ideas Research to gauge the “average sentiment” of a topic/person, etc. Finding research methodology to use Writing background section or conclusion of research paper Give difficult Q+A after practice talk Machine generated art & literature Writing songs, movies, plays, etc. Prediction of non-word features (large music models, large image models?) Content recommendation → on-demand content generation Personalized content creation/curation Companionship systems (e.g., elderly who live alone to converse with) Talk with deceased relatives People who want chatbot friend Imaginary pets (Damagochi) Imaginary significant other Apps that mimics interacting with celebrity Emotional support/Therapy Personalized emotional support. Therapist? Sentiment: based on chats or emails from person X, what does person X think about me? Education e.g., language learning tools Children in school learning to {write} (Insert topic) Machine generated education curriculum Virtual school tutor/teacher Interactive QA system for kids to learn about reading & thinking Language translation to facilitate inter-cultural/regional communication Practicing something in another language Creating marketing language Customer interactive, compelling ads Recommendation systems maybe personalized ads Personalized virtual representation Personal assistant/planner Virtual assistants / chatbots Government benefit claims & complaints Accessibility (dyslexia, neurodivergence) Disinformation & astroturfing Automated spam/harassment Deep fake audio (phishing…) Interactive deep-fake/pornography Emotionally engaging content Cognitive science strong generating stimuli Police/immigration: Interrogation assistant Government uses - summarize spy messages, search for secret information Government summarize intercepted calls Firewall monitoring (regex → LLM) packet introspection Circumvent your “textual footprint” by generic LLM language instead Child protective services interview assistants Improve training (if there is a speech synthesizer) Online content moderation App for “Ferris Bueller” (Help trick parents about locations) Parent’ control on child’s personal LLM
Stakeholders
Teachers, tutors, coaches Writers (journalists, marketing) All kinds of visual, digital artists Religious person (ideology in speech) Therapists, therapy clients Content creators, influencers Standard-setting organizations Government as a regulator Cloud service / database provider Sales/marketing individuals Workers replaced by models Laypeople who’re not aware the AI tools are being in use People contributing to training data Spy who wants help blending in Person automating hate/harassment (e.g., on Twitter) Person who wants to spread fake news Company that uses ChatGPT as a service Consumers of AI generated content