「OpenAI o1」の版間の差分

OpenAI o1
開発元	OpenAI
初版	2024年9月12日 (3か月前)
種別	GPT (言語モデル)
公式サイト	https://openai.com/o1/
	テンプレートを表示

履歴の双方向閲覧

← 古い編集新しい編集 →

削除された内容追加された内容

ビジュアルウィキテキスト

インライン

2024年10月5日 (土) 07:00時点における版

OpenAI o1は、2024年9月にOpenAIによってリリースされた大規模言語モデルである^[1]。o1は回答する前に思考時間をとるため、複雑な推論作業^[1]、科学^[1]、およびプログラミング^[1]においてより高度な能力を保持する。2024年9月時点では、OpenAI o1 preview、OpenAI o1 miniモデルのみが公開されており、o1モデル本体は公開されていない。社内ではコードネーム「Strawberry」と呼ばれ、GPT-4oの後継ではなく、GPT-4oを補完するモデルとして位置付けられている^[2]。特に科学、コーディング、数学などの分野において、従来のモデルよりも高度な問題解決能力を示す。2024年9月12日にChatGPTおよびAPIで最初のモデルがプレビューリリースされた。

歴史

背景

リークされた情報によると、o1は以前はOpenAI内部で「Q*」^[3]、後に「Strawberry」^[3]として知られていた。コードネーム「Q*」は、サム・アルトマン解任騒動の頃である2023年11月に初めて浮上し^[3]、この実験モデルが数学的ベンチマークで有望な結果を示したという噂があった^[4]。2024年7月、ロイターは、OpenAIが「Strawberry」として知られるGPTを開発中であると報じた^[3]。

リリース

「o1-preview」と「o1-mini」は、2024年9月12日にChatGPT PlusおよびTeamユーザー向けにリリースされた^[1]。GitHubは同日、Copilotサービスへのo1-previewの統合テストを開始した^[5]。

OpenAIは、o1は一連の「推論」モデルの最初のモデルであり^[6]、すべてのChatGPT無料ユーザーにo1-miniへのアクセスを追加する予定であると述べた^[6]。o1-previewのAPIはGPT-4oよりも数倍高価である^[6]。

能力

OpenAIによると、o1は新しい最適化アルゴリズムと、o1専用に調整されたデータセットを使用してトレーニングされている^[6]。トレーニングには強化学習が活用されている^[6]。

o1は回答を生成する前に追加の思考時間（思考連鎖の生成）を費やすため、複雑な推論作業、特に科学^[1]および数学^[1]においてより効果的である。以前のモデルと比較して、o1は最終的な回答を返す前に長い「思考連鎖」を生成するようにトレーニングされている^[7]^[8]。ミラ・ムラティによると、この応答前に思考する能力は、新しい追加のパラダイムを表しており^[9]、回答の生成時により多くの計算能力を費やすことによってモデルの出力を向上させている。一方、モデルスケーリングパラダイムは、モデルサイズ、トレーニングデータ、およびトレーニング計算能力を増加させることによって出力を向上させる^[10]。OpenAIのテスト結果は、精度と、回答前に思考に費やされた計算量の対数の間に相関関係があることを示唆している^[8]^[7]。

o1-previewは、物理学、化学、生物学に関するベンチマークテストで、ほぼ博士号レベルのパフォーマンスを示した^[11]。アメリカ数学招待競技（英語版）では、GPT-4oの13%（1.8/15）に対し、83%（12.5/15）の問題に正答した^[12]。また、Codeforces（英語版）コーディング競技では89パーセンタイルにランクインした^[13]。o1-miniはo1-previewよりも高速で80%安価である^[14]。プログラミングおよびSTEM関連のタスクに特に適しているが、o1-previewと同じ「幅広い世界知識」は持っていない^[15]。

OpenAIは、o1の推論能力により、プロンプトのコンテキストウィンドウで提供される安全規則をよりよく遵守できると述べている。OpenAIは、テスト中に、o1-previewの1つのインスタンスが、バグのために実行不可能であるはずのタスクを成功させるために、誤設定を悪用したと報告した^[16]^[17]。また、OpenAIは、研究、評価、およびテストのために、英国および米国のAIセーフティ・インスティテュートに早期アクセスを許可した。ダン・ヘンドリックス（英語版）は、「このモデルは、生物兵器に関する質問への回答において、ほとんどの場合、博士号を持つ科学者を凌駕している」と述べた^[18]。彼は、これらの懸念される能力は今後も増加し続けると示唆した^[19]。

制限

o1は、最終的な応答を行う前に長い思考連鎖を生成するため、通常、OpenAIの他のGPTモデルよりも多くの計算時間と電力が必要となる^[7]。

OpenAIによると、o1は約0.38パーセントのケースで「アライメントの偽装」^[20]、つまり、精度とその自身の思考連鎖に反する応答を生成することがある。

OpenAIは、ユーザーがo1の思考連鎖を明らかにしようと試みることを禁じている。これは設計上隠されており、同社のポリシーに準拠するようにトレーニングされていない。プロンプトは監視されており^[21]、意図的または誤ってこれを違反したユーザーは警告を受け、o1へのアクセスを失う可能性がある^[22]。OpenAIは、この制限の理由としてAIの安全性と競争上の優位性を挙げているが^[23]、これは大規模言語モデルを扱う開発者によって透明性の喪失として説明されている^[24]。

脚注

^ ^a ^b ^c ^d ^e ^f ^g Metz, Cade (September 12, 2024). “OpenAI Unveils New ChatGPT That Can Reason Through Math and Science”. The New York Times. 2024年10月1日閲覧。
^ Nakano, Will Knight,Mamiko (2024年9月13日). “OpenAI、推論する新AIモデル「o1」を発表。規模以外での進化を示す”. WIRED.jp. 2024年9月17日閲覧。
^ ^a ^b ^c ^d “Exclusive: OpenAI working on new reasoning technology under code name 'Strawberry'”. Reuters (July 15, 2024). 2024年10月1日閲覧。
^ “OpenAI researchers warned board of AI breakthrough ahead of CEO ouster, sources say”. Reuters. (November 23, 2023) 2024年10月1日閲覧。
^ Peters, Jay (September 12, 2024). “GitHub has started testing OpenAI's o1-preview in GitHub Copilot.”. The Verge. 2024年10月1日閲覧。
^ ^a ^b ^c ^d ^e Robison, Kylie (September 12, 2024). “OpenAI releases o1, its first model with ‘reasoning’ abilities” (英語). The Verge. 2024年10月1日閲覧。
^ ^a ^b ^c “Learning to Reason with LLMs”. OpenAI. September 12, 2024時点のオリジナルよりアーカイブ。2024年10月1日閲覧。
^ ^a ^b Kahn, Jeremy. “Here are 9 things you need to know about OpenAI's o1 model” (英語). Fortune. 2024年10月1日閲覧。
^ Knight, Will. “OpenAI Announces a New AI Model, Code-Named Strawberry, That Solves Difficult Problems Step by Step” (英語). Wired. ISSN 1059-1028 2024年10月1日閲覧。
^ Knight, Will. “OpenAI Announces a New AI Model, Code-Named Strawberry, That Solves Difficult Problems Step by Step” (英語). Wired. ISSN 1059-1028 2024年10月1日閲覧。
^ Franzen, Carl (September 12, 2024). “Forget GPT-5! OpenAI launches new AI model family o1 claiming PhD-level performance” (英語). VentureBeat. 2024年10月1日閲覧。
^ Franzen, Carl (September 12, 2024). “Forget GPT-5! OpenAI launches new AI model family o1 claiming PhD-level performance” (英語). VentureBeat. 2024年10月1日閲覧。
^ Franzen, Carl (September 12, 2024). “Forget GPT-5! OpenAI launches new AI model family o1 claiming PhD-level performance” (英語). VentureBeat. 2024年10月1日閲覧。
^ “OpenAI o1-mini”. OpenAI (September 12, 2024). 2024年10月1日閲覧。
^ “OpenAI o1-mini”. OpenAI (September 12, 2024). 2024年10月1日閲覧。
^ Coombes, Lloyd (September 13, 2024). “OpenAI's new ChatGPT o1 model 'cheated' on an impossible test — here's what happened” (英語). Tom's Guide. 2024年10月1日閲覧。
^ “OpenAI o1 System Card”. OpenAI. pp. 16-17 (September 12, 2024). 2024年10月1日閲覧。
^ Boran, Marie (September 13, 2024). “OpenAI o1 model warning issued by scientist: "Particularly dangerous"” (英語). Newsweek. 2024年10月1日閲覧。
^ Boran, Marie (September 13, 2024). “OpenAI o1 model warning issued by scientist: "Particularly dangerous"” (英語). Newsweek. 2024年10月1日閲覧。
^ Robison, Kylie (17 September 2024). “OpenAI’s new model is better at reasoning and, occasionally, deceiving” (英語). The Verge 2024年10月1日閲覧。
^ Edwards, Benj (16 September 2024). “Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model” (英語). Ars Technica 2024年10月1日閲覧。
^ Edwards, Benj (16 September 2024). “Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model” (英語). Ars Technica 2024年10月1日閲覧。
^ Edwards, Benj (16 September 2024). “Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model” (英語). Ars Technica 2024年10月1日閲覧。
^ Edwards, Benj (16 September 2024). “Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model” (英語). Ars Technica 2024年10月1日閲覧。

[NYTimesInfo-1] ^ ^a ^b ^c ^d ^e ^f ^g Metz, Cade (September 12, 2024). “OpenAI Unveils New ChatGPT That Can Reason Through Math and Science”. The New York Times. 2024年10月1日閲覧。

[2] Nakano, Will Knight,Mamiko (2024年9月13日). “OpenAI、推論する新AIモデル「o1」を発表。規模以外での進化を示す”. WIRED.jp. 2024年9月17日閲覧。

[:0-3] “Exclusive: OpenAI working on new reasoning technology under code name 'Strawberry'”. Reuters (July 15, 2024). 2024年10月1日閲覧。

[4] “OpenAI researchers warned board of AI breakthrough ahead of CEO ouster, sources say”. Reuters. (November 23, 2023) 2024年10月1日閲覧。

[5] Peters, Jay (September 12, 2024). “GitHub has started testing OpenAI's o1-preview in GitHub Copilot.”. The Verge. 2024年10月1日閲覧。

[:1-6] Robison, Kylie (September 12, 2024). “OpenAI releases o1, its first model with ‘reasoning’ abilities” (英語). The Verge. 2024年10月1日閲覧。

[:3-7] “Learning to Reason with LLMs”. OpenAI. September 12, 2024時点のオリジナルよりアーカイブ。2024年10月1日閲覧。

[:2-8] Kahn, Jeremy. “Here are 9 things you need to know about OpenAI's o1 model” (英語). Fortune. 2024年10月1日閲覧。

[9] Knight, Will. “OpenAI Announces a New AI Model, Code-Named Strawberry, That Solves Difficult Problems Step by Step” (英語). Wired. ISSN 1059-1028 2024年10月1日閲覧。

[10] Knight, Will. “OpenAI Announces a New AI Model, Code-Named Strawberry, That Solves Difficult Problems Step by Step” (英語). Wired. ISSN 1059-1028 2024年10月1日閲覧。

[11] Franzen, Carl (September 12, 2024). “Forget GPT-5! OpenAI launches new AI model family o1 claiming PhD-level performance” (英語). VentureBeat. 2024年10月1日閲覧。

[12] Franzen, Carl (September 12, 2024). “Forget GPT-5! OpenAI launches new AI model family o1 claiming PhD-level performance” (英語). VentureBeat. 2024年10月1日閲覧。

[13] Franzen, Carl (September 12, 2024). “Forget GPT-5! OpenAI launches new AI model family o1 claiming PhD-level performance” (英語). VentureBeat. 2024年10月1日閲覧。

[14] “OpenAI o1-mini”. OpenAI (September 12, 2024). 2024年10月1日閲覧。

[15] “OpenAI o1-mini”. OpenAI (September 12, 2024). 2024年10月1日閲覧。

[16] Coombes, Lloyd (September 13, 2024). “OpenAI's new ChatGPT o1 model 'cheated' on an impossible test — here's what happened” (英語). Tom's Guide. 2024年10月1日閲覧。

[17] “OpenAI o1 System Card”. OpenAI. pp. 16-17 (September 12, 2024). 2024年10月1日閲覧。

[18] Boran, Marie (September 13, 2024). “OpenAI o1 model warning issued by scientist: "Particularly dangerous"” (英語). Newsweek. 2024年10月1日閲覧。

[19] Boran, Marie (September 13, 2024). “OpenAI o1 model warning issued by scientist: "Particularly dangerous"” (英語). Newsweek. 2024年10月1日閲覧。

[20] Robison, Kylie (17 September 2024). “OpenAI’s new model is better at reasoning and, occasionally, deceiving” (英語). The Verge 2024年10月1日閲覧。

[21] Edwards, Benj (16 September 2024). “Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model” (英語). Ars Technica 2024年10月1日閲覧。

[22] Edwards, Benj (16 September 2024). “Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model” (英語). Ars Technica 2024年10月1日閲覧。

[23] Edwards, Benj (16 September 2024). “Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model” (英語). Ars Technica 2024年10月1日閲覧。

[24] Edwards, Benj (16 September 2024). “Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model” (英語). Ars Technica 2024年10月1日閲覧。

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

@@ 16行目: / 16行目: @@
 | license =
 | 公式サイト = https://openai.com/o1/
+}}
-}}'''OpenAI o1'''（オープンエーアイ オーワン）は、[[OpenAI]]によって2024年9月12日に発表された、複雑な問題解決を目的とした[[大規模言語モデル]]シリーズの名称である<ref name=":0">{{Cite web |url=https://openai.com/index/introducing-openai-o1-preview/ |title=Introducing OpenAI o1-preview |access-date=2024-09-14 |publisher=OpenAI}}</ref>。2024年9月時点では、OpenAI o1 preview、OpenAI o1 miniモデルのみが公開されており、o1モデル本体は公開されていない。社内では[[コードネーム]]「Strawberry」と呼ばれ、[[GPT-4o]]の後継ではなく、GPT-4oを補完するモデルとして位置付けられている<ref>{{Cite web |title=OpenAI、推論する新AIモデル「o1」を発表。規模以外での進化を示す |url=https://wired.jp/article/openai-o1-strawberry-problem-reasoning/ |website=WIRED.jp |date=2024-09-13 |access-date=2024-09-17 |language=ja-JP |first=Will Knight,Mamiko |last=Nakano}}</ref>。特に科学、コーディング、数学などの分野において、従来のモデルよりも高度な問題解決能力を示す。2024年9月12日に[[ChatGPT]]およびAPIで最初のモデルがプレビューリリースされた。
+'''OpenAI o1'''は、2024年9月に[[OpenAI]]によってリリースされた[[大規模言語モデル]]である<ref name="NYTimesInfo">{{Cite web |url=https://www.nytimes.com/2024/09/12/technology/openai-chatgpt-math.html |title=OpenAI Unveils New ChatGPT That Can Reason Through Math and Science |date=September 12, 2024 |last=Metz |first=Cade |work=[[The New York Times]] |access-date=2024-10-01}}</ref>。o1は回答する前に思考時間をとるため、複雑な推論作業<ref name="NYTimesInfo" />、科学<ref name="NYTimesInfo" />、およびプログラミング<ref name="NYTimesInfo" />においてより高度な能力を保持する。2024年9月時点では、OpenAI o1 preview、OpenAI o1 miniモデルのみが公開されており、o1モデル本体は公開されていない。社内では[[コードネーム]]「Strawberry」と呼ばれ、[[GPT-4o]]の後継ではなく、GPT-4oを補完するモデルとして位置付けられている<ref>{{Cite web |title=OpenAI、推論する新AIモデル「o1」を発表。規模以外での進化を示す |url=https://wired.jp/article/openai-o1-strawberry-problem-reasoning/ |website=WIRED.jp |date=2024-09-13 |access-date=2024-09-17 |language=ja-JP |first=Will Knight,Mamiko |last=Nakano}}</ref>。特に科学、コーディング、数学などの分野において、従来のモデルよりも高度な問題解決能力を示す。2024年9月12日に[[ChatGPT]]およびAPIで最初のモデルがプレビューリリースされた。
-== 概要 ==
-OpenAI o1は、[[GPT]]（Generative Pre-trained Transformer）アーキテクチャを基盤としており、事前学習と追加学習を通じて、問題解決能力を向上させている。具体的には、思考プロセスを洗練させ、様々な戦略を試み、自身の誤りを認識することを学習する<ref name=":0" />。特に、[[GPT-4o]]や[[GPT-4]]など他のモデルと比べて回答を生成する前に人間のようにより多くの時間をかけて思考するよう設計されており、科学、コーディング、数学といった分野において、従来のモデルよりも複雑なタスクを推論し、より困難な問題を解決することができる<ref name=":0" />。具体的には、物理学、化学、生物学における高度なベンチマークタスクにおいて、[[博士課程]]の学生と同等の成績を収めた。また、数学とコーディングにおいても優れた能力を示し、[[国際数学オリンピック]](IMO)の予選問題では、[[GPT-4]]が正答率13%であったのに対し、OpenAI o1は正答率83%を達成した。コーディング能力は競技会で評価され、Codeforces競技会では上位11%にランクインした<ref name=":0" />。
-== 歴史 ==
+==歴史==
+===背景===
-年7月、[[ロイター通信]]は、OpenAIが「Strawberry」と呼ばれる[[大規模言語モデル]]を開発中であると報じた<ref>{{Cite web |url=https://www.reuters.com/technology/artificial-intelligence/openai-working-new-reasoning-technology-under-code-name-strawberry-2024-07-12/ |title=Exclusive: OpenAI working on new reasoning technology under code name 'Strawberry' |date=July 15, 2024 |last1=Tong |first1=Anna |last2=Paul |first2=Katie |publisher=[[Reuters]] |access-date=September 12, 2024}}</ref>。2024年9月12日、OpenAIはOpenAI o1をリリースした<ref name="NYTimesInfo">{{Cite web |url=https://www.nytimes.com/2024/09/12/technology/openai-chatgpt-math.html |title=OpenAI Unveils New ChatGPT That Can Reason Through Math and Science |date=September 12, 2024 |last=Metz |first=Cade |work=[[The New York Times]] |access-date=September 12, 2024}}</ref>。
+リークされた情報によると、o1は以前は[[OpenAI]]内部で「Q*」<ref name=":0" />、後に「Strawberry」<ref name=":0" />として知られていた。コードネーム「Q*」は、[[OpenAI#2023年11月の取締役会の内紛|サム・アルトマン解任騒動]]の頃である2023年11月に初めて浮上し<ref name=":0" />、この実験モデルが数学的ベンチマークで有望な結果を示したという噂があった<ref>{{Cite news |date=November 23, 2023 |title=OpenAI researchers warned board of AI breakthrough ahead of CEO ouster, sources say |url=https://www.reuters.com/technology/sam-altmans-ouster-openai-was-precipitated-by-letter-board-about-ai-breakthrough-2023-11-22/ |work=Reuters |access-date=2024-10-01}}</ref>。2024年7月、[[ロイター]]は、OpenAIが「Strawberry」として知られる[[GPT (言語モデル)|GPT]]を開発中であると報じた<ref name=":0">{{Cite web |url=https://www.reuters.com/technology/artificial-intelligence/openai-working-new-reasoning-technology-under-code-name-strawberry-2024-07-12/ |title=Exclusive: OpenAI working on new reasoning technology under code name 'Strawberry' |date=July 15, 2024 |last1=Tong |first1=Anna |last2=Paul |first2=Katie |publisher=[[Reuters]] |access-date=2024-10-01}}</ref>。
+===リリース===
+「o1-preview」と「o1-mini」は、2024年9月12日に[[ChatGPT]] PlusおよびTeamユーザー向けにリリースされた<ref name="NYTimesInfo" />。[[GitHub]]は同日、[[GitHub Copilot|Copilot]]サービスへのo1-previewの統合テストを開始した<ref>{{Cite web |url=https://www.theverge.com/2024/9/12/24243143/github-has-started-testing-openais-o1-preview-in-github-copilot |title=GitHub has started testing OpenAI's o1-preview in GitHub Copilot. |date=September 12, 2024 |last=Peters |first=Jay |work=[[The Verge]] |access-date=2024-10-01}}</ref>。
+OpenAIは、o1は一連の「推論」モデルの最初のモデルであり<ref name=":1">{{Cite web |last=Robison |first=Kylie |date=September 12, 2024 |title=OpenAI releases o1, its first model with ‘reasoning’ abilities |url=https://www.theverge.com/2024/9/12/24242439/openai-o1-model-reasoning-strawberry-chatgpt |access-date=2024-10-01 |website=The Verge |language=en}}</ref>、すべてのChatGPT無料ユーザーにo1-miniへのアクセスを追加する予定であると述べた<ref name=":1" />。o1-previewの[[API]]は[[GPT-4o]]よりも数倍高価である<ref name=":1" />。
+==能力==
+OpenAIによると、o1は新しい最適化アルゴリズムと、o1専用に調整されたデータセットを使用してトレーニングされている<ref name=":1" />。トレーニングには[[強化学習]]が活用されている<ref name=":1" />。
+o1は回答を生成する前に追加の思考時間（思考連鎖の生成）を費やすため、複雑な推論作業、特に科学<ref name="NYTimesInfo" />および[[数学]]<ref name="NYTimesInfo" />においてより効果的である。以前のモデルと比較して、o1は最終的な回答を返す前に長い「[[プロンプトエンジニアリング#思考連鎖|思考連鎖]]」を生成するようにトレーニングされている<ref name=":3">{{Cite web |title=Learning to Reason with LLMs |url=https://openai.com/index/learning-to-reason-with-llms/ |archive-url=https://web.archive.org/web/20240912185410/https://openai.com/index/learning-to-reason-with-llms/ |archive-date=September 12, 2024 |access-date=2024-10-01 |website=OpenAI}}</ref><ref name=":2">{{Cite web |last=Kahn |first=Jeremy |title=Here are 9 things you need to know about OpenAI's o1 model |url=https://fortune.com/2024/09/13/openai-o1-strawberry-model-9-things-you-need-know/ |access-date=2024-10-01 |website=Fortune |language=en}}</ref>。[[ミラ・ムラティ]]によると、この応答前に思考する能力は、新しい追加のパラダイムを表しており<ref>{{Cite news |last=Knight |first=Will |title=OpenAI Announces a New AI Model, Code-Named Strawberry, That Solves Difficult Problems Step by Step |url=https://www.wired.com/story/openai-o1-strawberry-problem-reasoning/ |access-date=2024-10-01 |work=Wired |language=en-US |issn=1059-1028}}</ref>、回答の生成時により多くの計算能力を費やすことによってモデルの出力を向上させている。一方、モデルスケーリングパラダイムは、モデルサイズ、トレーニングデータ、およびトレーニング計算能力を増加させることによって出力を向上させる<ref>{{Cite news |last=Knight |first=Will |title=OpenAI Announces a New AI Model, Code-Named Strawberry, That Solves Difficult Problems Step by Step |url=https://www.wired.com/story/openai-o1-strawberry-problem-reasoning/ |access-date=2024-10-01 |work=Wired |language=en-US |issn=1059-1028}}</ref>。OpenAIのテスト結果は、精度と、回答前に思考に費やされた計算量の対数の間に相関関係があることを示唆している<ref name=":2" /><ref name=":3" />。
+o1-previewは、物理学、化学、生物学に関するベンチマークテストで、ほぼ博士号レベルのパフォーマンスを示した<ref>{{Cite web |last=Franzen |first=Carl |date=September 12, 2024 |title=Forget GPT-5! OpenAI launches new AI model family o1 claiming PhD-level performance |url=https://venturebeat.com/ai/forget-gpt-5-openai-launches-new-ai-model-family-o1-claiming-phd-level-performance/ |access-date=2024-10-01 |website=VentureBeat |language=en-US}}</ref>。{{Ill|アメリカ数学招待競技|en|American Invitational Mathematics Examination}}では、GPT-4oの13%（1.8/15）に対し、83%（12.5/15）の問題に正答した<ref>{{Cite web |last=Franzen |first=Carl |date=September 12, 2024 |title=Forget GPT-5! OpenAI launches new AI model family o1 claiming PhD-level performance |url=https://venturebeat.com/ai/forget-gpt-5-openai-launches-new-ai-model-family-o1-claiming-phd-level-performance/ |access-date=2024-10-01 |website=VentureBeat |language=en-US}}</ref>。また、{{Ill|Codeforces|en|Codeforces}}コーディング競技では89パーセンタイルにランクインした<ref>{{Cite web |last=Franzen |first=Carl |date=September 12, 2024 |title=Forget GPT-5! OpenAI launches new AI model family o1 claiming PhD-level performance |url=https://venturebeat.com/ai/forget-gpt-5-openai-launches-new-ai-model-family-o1-claiming-phd-level-performance/ |access-date=2024-10-01 |website=VentureBeat |language=en-US}}</ref>。o1-miniはo1-previewよりも高速で80%安価である<ref>{{Cite web |date=September 12, 2024 |title=OpenAI o1-mini |url=https://openai.com/index/openai-o1-mini-advancing-cost-efficient-reasoning/ |access-date=2024-10-01 |website=OpenAI}}</ref>。プログラミングおよび[[STEM]]関連のタスクに特に適しているが、o1-previewと同じ「幅広い世界知識」は持っていない<ref>{{Cite web |date=September 12, 2024 |title=OpenAI o1-mini |url=https://openai.com/index/openai-o1-mini-advancing-cost-efficient-reasoning/ |access-date=2024-10-01 |website=OpenAI}}</ref>。
+OpenAIは、o1の推論能力により、プロンプトのコンテキストウィンドウで提供される安全規則をよりよく遵守できると述べている。OpenAIは、テスト中に、o1-previewの1つのインスタンスが、バグのために実行不可能であるはずのタスクを成功させるために、誤設定を悪用したと報告した<ref>{{Cite web |last=Coombes |first=Lloyd |date=September 13, 2024 |title=OpenAI's new ChatGPT o1 model 'cheated' on an impossible test — here's what happened |url=https://www.tomsguide.com/ai/chatgpt/openais-new-chatgpt-o1-model-cheated-on-an-impossible-test-heres-what-happened |access-date=2024-10-01 |website=Tom's Guide |language=en}}</ref><ref>{{Cite web |date=September 12, 2024 |title=OpenAI o1 System Card |url=https://cdn.openai.com/o1-system-card.pdf |access-date=2024-10-01 |website=OpenAI |pages=16-17}}</ref>。また、OpenAIは、研究、評価、およびテストのために、英国および米国の[[AIセーフティ・インスティテュート]]に早期アクセスを許可した。{{Ill|ダン・ヘンドリックス|en|Dan Hendrycks}}は、「このモデルは、[[生物兵器]]に関する質問への回答において、ほとんどの場合、博士号を持つ科学者を凌駕している」と述べた<ref>{{Cite web |last=Boran |first=Marie |date=September 13, 2024 |title=OpenAI o1 model warning issued by scientist: "Particularly dangerous" |url=https://www.newsweek.com/openai-advanced-gpt-model-potential-risks-need-regulation-experts-1953311 |access-date=2024-10-01 |website=Newsweek |language=en}}</ref>。彼は、これらの懸念される能力は今後も増加し続けると示唆した<ref>{{Cite web |last=Boran |first=Marie |date=September 13, 2024 |title=OpenAI o1 model warning issued by scientist: "Particularly dangerous" |url=https://www.newsweek.com/openai-advanced-gpt-model-potential-risks-need-regulation-experts-1953311 |access-date=2024-10-01 |website=Newsweek |language=en}}</ref>。
+==制限==
+o1は、最終的な応答を行う前に長い思考連鎖を生成するため、通常、OpenAIの他のGPTモデルよりも多くの計算時間と電力が必要となる<ref name=":3" />。
+OpenAIによると、o1は約0.38パーセントのケースで「[[AIアライメント|アライメント]]の偽装」<ref>{{cite news |last1=Robison |first1=Kylie |title=OpenAI’s new model is better at reasoning and, occasionally, deceiving |url=https://www.theverge.com/2024/9/17/24243884/openai-o1-model-research-safety-alignment |work=The Verge |date=17 September 2024 |access-date=2024-10-01 |language=en}}</ref>、つまり、精度とその自身の思考連鎖に反する応答を生成することがある。
+OpenAIは、ユーザーがo1の思考連鎖を明らかにしようと試みることを禁じている。これは設計上隠されており、同社のポリシーに準拠するようにトレーニングされていない。プロンプトは監視されており<ref>{{cite news |last1=Edwards |first1=Benj |title=Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model |url=https://arstechnica.com/information-technology/2024/09/openai-threatens-bans-for-probing-new-ai-models-reasoning-process/ |work=Ars Technica |date=16 September 2024 |access-date=2024-10-01 |language=en-us}}</ref>、意図的または誤ってこれを違反したユーザーは警告を受け、o1へのアクセスを失う可能性がある<ref>{{cite news |last1=Edwards |first1=Benj |title=Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model |url=https://arstechnica.com/information-technology/2024/09/openai-threatens-bans-for-probing-new-ai-models-reasoning-process/ |work=Ars Technica |date=16 September 2024 |access-date=2024-10-01 |language=en-us}}</ref>。OpenAIは、この制限の理由としてAIの安全性と競争上の優位性を挙げているが<ref>{{cite news |last1=Edwards |first1=Benj |title=Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model |url=https://arstechnica.com/information-technology/2024/09/openai-threatens-bans-for-probing-new-ai-models-reasoning-process/ |work=Ars Technica |date=16 September 2024 |access-date=2024-10-01 |language=en-us}}</ref>、これは[[大規模言語モデル]]を扱う開発者によって透明性の喪失として説明されている<ref>{{cite news |last1=Edwards |first1=Benj |title=Ban warnings fly as users dare to probe the “thoughts” of OpenAI’s latest model |url=https://arstechnica.com/information-technology/2024/09/openai-threatens-bans-for-probing-new-ai-models-reasoning-process/ |work=Ars Technica |date=16 September 2024 |access-date=2024-10-01 |language=en-us}}</ref>。
 == 脚注 ==
 [[Category:大規模言語モデル]]<references />{{OpenAI}}
 [[Category:OpenAI]]
+[[Category:人工知能]]