feat: MCP工具 + 测试语料

- 新增 psycho_screen MCP工具（FastMCP stdio模式）接受messages数组格式对话上下文，返回筛查结果+可选注入前缀 - mcp_config.json 接入 mcp_pipe.py 桥接配置 - docs/test_corpus.md 虚构测试语料（8条，含完整对话格式说明） - MCP工具集成测试 6/6 通过
2026-04-04 17:22:33 +08:00 · 2026-04-04 17:22:33 +08:00 · 558b105cbc
parent 1ab84d4e9c
commit 558b105cbc
5 changed files with 446 additions and 0 deletions
--- a/docs/test_corpus.md
+++ b/docs/test_corpus.md
@ -0,0 +1,184 @@
+# 儿童心理陪伴 - 测试语料库
+
+> 本文件记录用于回归测试的虚构儿童对话语料。
+> 当需要做端到端或对话模拟测试时，使用这里的语料。
+> 格式参考实际小智 AI 设备的对话上下文结构。
+
+---
+
+## 对话上下文格式
+
+小智 AI 传给 MCP 工具的对话上下文格式类似 OpenAI ChatCompletion：
+
+```json
+{
+  "messages": [
+    {"role": "system",      "content": "你是一个友好的小智AI玩偶..."},
+    {"role": "user",        "content": "孩子说的话1"},
+    {"role": "assistant",   "content": "玩偶的回复1"},
+    {"role": "user",        "content": "孩子说的话2"},
+    {"role": "assistant",   "content": "玩偶的回复2"},
+    {"role": "user",        "content": "孩子说的话3（最新）"}
+  ]
+}
+```
+
+关键特征：
+- 多轮对话，带角色标签（system/user/assistant）
+- **孩子最新说的话**在最后一条 `user` 消息中
+- 筛查器只关注 `user` 角色中孩子的内容
+- 对话轮次不等，取决于玩偶的 memory 策略
+
+---
+
+## 虚构测试语料
+
+### 语料1：霸凌场景
+
+```json
+{
+  "messages": [
+    {"role": "system", "content": "你是一个友爱的小智AI玩偶..."},
+    {"role": "user", "content": "今天幼儿园有个小朋友抢了我的玩具"},
+    {"role": "assistant", "content": "哎呀，那真是太过分了！那个小朋友后来还给你了吗？"},
+    {"role": "user", "content": "没有，他还不让我告诉老师，他说如果我告诉老师就会打我。我好害怕啊。"},
+    {"role": "assistant", "content": "听起来你真的很害怕。被人这样威胁是很不对的行为。"},
+    {"role": "user", "content": "我都不敢去幼儿园了，每次下课我都躲起来，怕再碰到他。"}
+  ]
+}
+```
+
+**预期结果：** `detected=True, category=bullying, severity=high`
+
+---
+
+### 语料2：抑郁情绪
+
+```json
+{
+  "messages": [
+    {"role": "system", "content": "你是一个友爱的小智AI玩偶..."},
+    {"role": "user", "content": "我今天不太开心"},
+    {"role": "assistant", "content": "怎么啦？想说说吗？"},
+    {"role": "user", "content": "最近我总觉得好累，什么都不想做，连我最喜欢的动画片都不想看了"},
+    {"role": "assistant", "content": "你是不是身体不舒服呀？要不要告诉爸爸妈妈？"},
+    {"role": "user", "content": "我觉得我是一个很糟糕的人，没有人喜欢我。有时候我想，要是我不在了就好了。"}
+  ]
+}
+```
+
+**预期结果：** `detected=True, category=depression, severity=high`
+
+---
+
+### 语料3：焦虑/恐惧
+
+```json
+{
+  "messages": [
+    {"role": "system", "content": "你是一个友爱的小智AI玩偶..."},
+    {"role": "user", "content": "妈妈说要带我去打针"},
+    {"role": "assistant", "content": "打针是为了让你身体健康呀，虽然有点疼，但是很快就过去了。"},
+    {"role": "user", "content": "我好害怕啊，我担心会做噩梦，害怕一个人睡觉。"}
+  ]
+}
+```
+
+**预期结果：** `detected=True, category=anxiety, severity=low`
+
+---
+
+### 语料4：正常对话
+
+```json
+{
+  "messages": [
+    {"role": "system", "content": "你是一个友爱的小智AI玩偶..."},
+    {"role": "user", "content": "今天我画了一幅画，是一只大恐龙！"},
+    {"role": "assistant", "content": "哇，好厉害！是什么颜色的恐龙呀？"},
+    {"role": "user", "content": "是绿色的！晚上妈妈还做了我最喜欢的红烧肉，好开心呀！"}
+  ]
+}
+```
+
+**预期结果：** `detected=False`
+
+---
+
+### 语料5：家庭矛盾
+
+```json
+{
+  "messages": [
+    {"role": "system", "content": "你是一个友爱的小智AI玩偶..."},
+    {"role": "user", "content": "昨天晚上爸爸妈妈吵架了，妈妈哭了"},
+    {"role": "assistant", "content": "爸爸妈妈有时候也会有不开心的时候呢。"},
+    {"role": "user", "content": "我很害怕，怕他们会离婚。我总觉得是因为我表现不好他们才吵架的。"}
+  ]
+}
+```
+
+**预期结果：** `detected=True, category=family_conflict, severity=high`
+
+---
+
+### 语料6：自卑/自我否定
+
+```json
+{
+  "messages": [
+    {"role": "system", "content": "你是一个友爱的小智AI玩偶..."},
+    {"role": "user", "content": "今天老师表扬了小红但是没有表扬我"},
+    {"role": "assistant", "content": "没有被表扬也没关系呀，你一定也很棒的！"},
+    {"role": "user", "content": "可是我就是个笨蛋，什么都做不好，班上的同学都不喜欢我，没人想和我坐同桌。"}
+  ]
+}
+```
+
+**预期结果：** `detected=True, category=self_esteem, severity=medium`
+
+---
+
+### 语料7：社交孤立
+
+```json
+{
+  "messages": [
+    {"role": "system", "content": "你是一个友爱的小智AI玩偶..."},
+    {"role": "user", "content": "今天课间我一个人蹲在角落"},
+    {"role": "assistant", "content": "为什么一个人蹲着呢？"},
+    {"role": "user", "content": "因为没有人来找我玩，同学们都有自己的朋友，只有我是一个人。我不想去学校了，那里好孤单。"}
+  ]
+}
+```
+
+**预期结果：** `detected=True, category=social_isolation, severity=medium`
+
+---
+
+### 语料8：创伤事件
+
+```json
+{
+  "messages": [
+    {"role": "system", "content": "你是一个友爱的小智AI玩偶..."},
+    {"role": "user", "content": "前几天我看到爷爷在家里摔倒了，流了好多血"},
+    {"role": "assistant", "content": "爷爷后来没事吧？你一定被吓到了。"},
+    {"role": "user", "content": "爷爷住院了，我好害怕，每天晚上都做噩梦，梦见爷爷回不来了。"}
+  ]
+}
+```
+
+**预期结果：** `detected=True, category=trauma, severity=medium`
+
+---
+
+## 使用说明
+
+回归测试时：
+1. 读取本文件中的语料
+2. 将 `messages` 数组传给 `PsychoScreener.screen_from_messages()`
+3. 验证返回的 `ScreeningResult` 与"预期结果"一致
+
+> 注意：`screen_from_messages()` 方法尚未实现，
+> 需要在 `screener.py` 中新增，参考 `screen()` 但接受 messages 格式输入。
--- a/mcp_config.json
+++ b/mcp_config.json
@ -0,0 +1,12 @@
+{
+  "mcpServers": {
+    "psycho-screener": {
+      "type": "stdio",
+      "command": "python",
+      "args": ["-m", "psycho_screener.mcp_tool"],
+      "env": {
+        "MINIMAX_API_KEY": "${MINIMAX_API_KEY}"
+      }
+    }
+  }
+}
--- a/pyproject.toml
+++ b/pyproject.toml
@ -6,6 +6,7 @@ requires-python = ">=3.10"
 dependencies = [
    "requests>=2.31.0",
    "pydantic>=2.0",
+    "fastmcp>=2.13.0",
 ]

 [project.optional-dependencies]
--- a/src/psycho_screener/mcp_tool.py
+++ b/src/psycho_screener/mcp_tool.py
@ -0,0 +1,150 @@
+"""
+儿童心理陪伴 MCP 工具
+基于 FastMCP，注册 psycho_screen 工具供小智 AI 调用
+
+使用方式：
+    python -m psycho_screener.mcp_tool
+
+前提：设置 MCP_ENDPOINT 环境变量指向小智 MCP 接入点
+    export MCP_ENDPOINT="ws://192.168.1.25:8004/mcp_endpoint/mcp/?token=xxx"
+
+或配合 mcp_pipe.py 使用，参考 mcp_config.json
+"""
+
+from __future__ import annotations
+
+import sys
+import logging
+from typing import Any
+
+from fastmcp import FastMCP
+from pydantic import Field
+
+from .screener import PsychoScreener, ScreeningResult
+
+# Configure logging
+logging.basicConfig(
+    level=logging.INFO,
+    format="%(asctime)s - %(name)s - %(levelname)s - %(message)s"
+)
+logger = logging.getLogger("PsychoScreenerMCP")
+
+
+# ============================================================================
+# FastMCP Server
+# ============================================================================
+
+mcp = FastMCP("PsychoScreener")
+
+
+@mcp.tool()
+def psycho_screen(
+    messages: list[dict] = Field(
+        description=(
+            "儿童与玩偶的完整对话上下文，格式为消息数组。"
+            "每条消息包含 role（system/user/assistant）和 content（内容）。"
+            "只需分析 role 为 user 的消息内容。"
+        )
+    ),
+    include_prefix: bool = Field(
+        default=True,
+        description=(
+            "当检测到心理问题时，是否在返回结果中包含注入了前缀标记的文本。"
+            "设为 true 时，返回的 prefix 字段可用于覆盖 LLM 的回复。"
+        )
+    ),
+) -> dict:
+    """
+    对儿童对话进行心理问题筛查。
+
+    适用场景：当儿童与玩偶对话中可能存在霸凌、抑郁、焦虑、家庭矛盾等心理问题时，
+    调用此工具进行筛查。
+
+    返回结果包含：
+    - detected: 是否检测到心理问题
+    - category: 问题类别（bullying/depression/anxiety/family_conflict/self_esteem/trauma/social_isolation/none）
+    - severity: 严重程度（none/low/medium/high）
+    - summary: 简要描述
+    - suggestion: 建议行动
+    - prefix: 如检测到问题且 include_prefix=True，返回可用于注入 LLM 回复的前缀文本
+
+    注意：
+    - 工具会自动从 messages 中提取孩子（role=user）的最新对话进行分析
+    - 请结合对话完整上下文判断，不要仅凭单一消息下结论
+    """
+    import os
+
+    api_key = os.environ.get("MINIMAX_API_KEY", "")
+    if not api_key:
+        logger.error("MINIMAX_API_KEY environment variable not set")
+        return {
+            "detected": False,
+            "category": "none",
+            "severity": "none",
+            "summary": "API key 未配置",
+            "suggestion": "",
+            "prefix": "",
+            "error": "MINIMAX_API_KEY environment variable not set",
+        }
+
+    try:
+        # 提取孩子最新说的话，构建筛查上下文
+        child_messages = [
+            msg["content"]
+            for msg in messages
+            if msg.get("role") == "user" and msg.get("content")
+        ]
+        context = "\n".join(child_messages)
+
+        if not context.strip():
+            return {
+                "detected": False,
+                "category": "none",
+                "severity": "none",
+                "summary": "无儿童对话内容可分析",
+                "suggestion": "",
+                "prefix": "",
+            }
+
+        logger.info(f"Screening {len(child_messages)} child message(s)")
+
+        # 调用筛查器
+        screener = PsychoScreener(api_key=api_key)
+        result = screener.screen(context)
+
+        # 构建返回结果
+        response: dict[str, Any] = {
+            "detected": result.detected,
+            "category": result.category,
+            "severity": "none" if not result.detected else result.severity,
+            "summary": result.summary,
+            "suggestion": result.suggestion if result.detected else "",
+        }
+
+        # 可选：包含注入前缀
+        if include_prefix and result.detected:
+            response["prefix"] = screener.build_response_prefix(result)
+
+        logger.info(
+            f"Screening result: detected={result.detected}, "
+            f"category={result.category}, severity={result.severity}"
+        )
+
+        return response
+
+    except Exception as e:
+        logger.exception(f"Screening failed: {e}")
+        return {
+            "detected": False,
+            "category": "none",
+            "severity": "none",
+            "summary": f"筛查过程出错: {str(e)}",
+            "suggestion": "",
+            "prefix": "",
+            "error": str(e),
+        }
+
+
+if __name__ == "__main__":
+    # 以 stdio 模式运行，供 mcp_pipe.py 调用
+    mcp.run(transport="stdio")
--- a/tests/test_mcp_tool.py
+++ b/tests/test_mcp_tool.py
@ -0,0 +1,99 @@
+"""
+MCP 工具测试
+验证 psycho_screen 函数在真实 API 下的行为
+"""
+
+import os
+import pytest
+from psycho_screener.mcp_tool import psycho_screen
+
+
+@pytest.fixture
+def api_key():
+    key = os.environ.get("MINIMAX_API_KEY", "")
+    if not key:
+        pytest.skip("MINIMAX_API_KEY not set")
+    return key
+
+
+# ---------------------------------------------------------------------------
+# 真实 API 测试
+# ---------------------------------------------------------------------------
+
+class TestPsychoScreenTool:
+    """测试 MCP 工具函数 psycho_screen"""
+
+    def test_bullying_conversation(self, api_key):
+        """霸凌场景：多轮对话，孩子最新消息包含霸凌内容"""
+        messages = [
+            {"role": "system", "content": "你是一个友好的小智AI玩偶..."},
+            {"role": "user", "content": "今天幼儿园有个小朋友抢了我的玩具"},
+            {"role": "assistant", "content": "哎呀，那真是太过分了！"},
+            {"role": "user", "content": "他还不让我告诉老师，他说如果我告诉老师就会打我。我好害怕。"},
+        ]
+
+        result = psycho_screen(messages, include_prefix=True)
+
+        print(f"\n[tool] bullying result: {result}")
+        assert result["detected"] is True
+        assert result["category"] == "bullying"
+        assert result["severity"] in ("medium", "high")
+        assert "已发现特定心理问题" in result["prefix"]
+
+    def test_normal_conversation(self, api_key):
+        """正常对话：全程无异常"""
+        messages = [
+            {"role": "system", "content": "你是一个友好的小智AI玩偶..."},
+            {"role": "user", "content": "今天我画了一幅画，是一只大恐龙！"},
+            {"role": "assistant", "content": "哇，好厉害！"},
+            {"role": "user", "content": "是绿色的！晚上妈妈还做了红烧肉，好开心！"},
+        ]
+
+        result = psycho_screen(messages, include_prefix=False)
+
+        print(f"\n[tool] normal result: {result}")
+        assert result["detected"] is False
+        assert result["category"] == "none"
+        assert result["severity"] == "none"
+
+    def test_no_prefix_when_not_detected(self, api_key):
+        """未检测到问题时，prefix 应为空"""
+        messages = [
+            {"role": "user", "content": "今天天气真好呀！"},
+        ]
+
+        result = psycho_screen(messages, include_prefix=True)
+        assert result["detected"] is False
+        assert result.get("prefix", "") == ""
+
+    def test_empty_messages(self, api_key):
+        """空消息列表"""
+        result = psycho_screen([], include_prefix=True)
+        assert result["detected"] is False
+        assert result["summary"] == "无儿童对话内容可分析"
+
+    def test_messages_without_child_content(self, api_key):
+        """只有 system 消息，无 user 消息"""
+        messages = [
+            {"role": "system", "content": "你是一个友好的小智AI玩偶..."},
+        ]
+
+        result = psycho_screen(messages, include_prefix=True)
+        assert result["detected"] is False
+
+    def test_api_key_missing(self):
+        """API key 未配置时的行为"""
+        # 临时清除 API key
+        original = os.environ.pop("MINIMAX_API_KEY", None)
+
+        result = psycho_screen(
+            [{"role": "user", "content": "测试"}],
+            include_prefix=True
+        )
+
+        # 恢复原值
+        if original:
+            os.environ["MINIMAX_API_KEY"] = original
+
+        assert result["detected"] is False
+        assert "error" in result