11/3-11/16 AI 新知

Our vision to bring Microsoft Copilot to everyone, and more / Microsoft Copilot 微軟的新網站，看來要把 Bing Chat 提升到全公司層級 (而不只是 Bing 之下的一個子功能)，並支援 OpenAI GPTs，還有 Copilot 外掛

ChatGPT - 17,500+ Best Custom GPTs 靠這一個 GPTs 搜尋有哪些 GPTs 可用

Scaling multimodal understanding to long videos Google 的 Mirasol3B 高效影片理解模型

PixArt-α Online Stable Diffusion 推出的高效文生圖模型

Try out the Blazing Fast LCM Lora SD 1.5 in your browser 生圖超快，可邊打字邊生成圖片

Vaibhavs10/insanely-fast-whisper 超快速 Whisper 實作 (Large v3 於 98 秒內轉 2.5 小時音訊)

Smarter Summaries w/ Finetuning GPT-3.5 and Chain of Density - Instructor 微調 GPT-3.5 能產生 GPT-4 等級的摘要詳細方法 (由大摘要濃縮成小摘要)

langchain-ai/opengpts 開源的 GPTs 實作

Tokenwiz 視覺化理解切 Token 的結果

Cursorless is alien magic from the future 完全用語音寫程式

The Normal Blog - Infinite Context LLMs: Going Beyond RAG with Extended Minds 不需微調也能超越 RAG，在 Attention 節點可提取外部記憶的方法

Music ControlNet 更細緻控制音樂 (旋律、動態與節奏) 生成的方法

Hacking Google Bard - From Prompt Injection to Data Exfiltration 讓 Bard 資料洩露的攻擊方法 (已修正漏洞)

A recent LLM hallucination benchmark is making rounds, and people are jumping to conclusions based on a table screenshot. The eval is so problematic in many ways 說最近的 LLM 幻覺比較無意義，最簡單的實作就是直接抄原文幾句話，保證無幻覺

GPU Survival Toolkit for the AI age: The bare minimum every developer must know GPU 程式基礎知識

GraphCast: AI model for faster and more accurate global weather forecasting Google 的全球氣象預測準確度已經可超越傳統數值模擬的預測結果，而且快得多

Copilot is an Incumbent Business Model Copilot 只是現有工具的加強，並不是顛覆性創新 (創造新市場或超低成本)

The Future of Music: How Generative AI Is Transforming the Music Industry AI 如何改變音樂產業，由即時串流到專業工具，很全面的分析

Inside OpenAI: How does ChatGPT Ship So Quickly? OpenAI 工程部門帶領者，談如何結合最尖端研究員，仍能快速推出產品的軟體工程實踐 (可惜後半文章要付費訂閱才能閱讀)

Don’t Build AI Products The Way Everyone Else Is Doing It 作者建議要靠程式邏輯，再加儘可能少的 AI 來獲得完整解決方案 (不要只是 ChatGPT Wrapper)

Large Language Models Understand and Can be Enhanced by Emotional Stimuli 利用心理學讓 LLM 變得更強，如使用提示「這對我的工作很重要」。我在想那用情緒勒索的方式呢?

Is the Reversal Curse Real? 先前有論文說 LLM 無法由「A 是 B」推論出「B 是 A」，這篇則提出反對意見也有詳細測試，其中提供獨家簡單又有效的訓練方法。

Beyond Memorization: Violating Privacy Via Inference with Large Language Models LLM 可以透過對話的蛛絲馬跡推論出個人隱私，看這篇好像有趣的偵探故事，可以看最後面 I.3 的聊天紀錄範例，LLM 如何跟人對談吊出隱私資訊...

Role play with large language models 不要將 LLM 擬人化 (不要誤以為 LLM 有意識)，此文提出角色扮演概念，還能扯到「可能角色的多重宇宙中的擬像的疊加」，也就是下一個字有多種機率選擇，每個選擇將導致多個結果版本，就好像物理學的多重宇宙觀念一樣。

Levels of AGI: Operationalizing Progress on the Path to AGI Google 提出 AGI 的分級 (類似自駕分級): L0: 非 AI；L1: 與一般人相當；L2: 超越 50% 專家；L3: 超越 90% 專家；L4: 超越 99% 專家，還分狹窄領域或廣泛領域。裡面提到 Google AlphaFold (蛋白質分析) AlphaZero (圍棋) 是狹窄領域 L4，而 ChatGPT 只有廣泛領域 L1，看來還有好長一段路啊...

Thanks for reading Grounded till Now ↣ Infer the Future! Subscribe for free to receive new posts and support my work.