[RFC] Tests using the Everything server #582
Draft
Add this suggestion to a batch that can be applied as a single commit. This suggestion is invalid because no changes were made to the code. Suggestions cannot be applied while the pull request is closed. Suggestions cannot be applied while viewing a subset of changes. Only one suggestion per line can be applied in a batch. Add this suggestion to a batch that can be applied as a single commit. Applying suggestions on deleted lines is not supported. You must change the existing code in this line in order to create a valid suggestion. Outdated suggestions cannot be applied. This suggestion has been applied or marked resolved. Suggestions cannot be applied from pending reviews. Suggestions cannot be applied on multi-line comments. Suggestions cannot be applied while the pull request is queued to merge. Suggestion cannot be applied right now. Please check back later.
Add Playwright e2e tests which connect to the reference Everything server.
This implementation is in forks:
structuredContent
tool being tested here👋 @cliffhall @olaservo Your comments here seem to be discussing a similar idea.
👋 @jerome3o-anthropic In your MCP Dev conf talk, you mentioned working on example full-featured servers. This is a tiny contribution, but I thought you might be interested.
Motivation and Context
The Inspector lacks automated testing against real MCP servers, making it difficult to catch regressions and validate new functionality. (Much of the UI depends on a connected server.)
The Everything server provides example implementations of many MCP protocol features.
Together, these create an opportunity for a feedback loop to drive MCP specification validation and compliance:
I think there is a lot of potential in this direction.
I'm particularly interested in validation and compliance of MCP clients and servers, and how I can help. I know about the focus on validation in the Roadmap, and SDK compliance spec schema.
How Has This Been Tested?
Running in GitHub Actions, sample run
Request for Comments
Seeking feedback on the concept. But specifically:
Current scope
structuredContent
implements the new MCP 2025-06-18 specification feature)Current limitations
structuredContent
tool (can'tnpx ...
from a sub-package branch)Next Steps
npx modelcontextprotocol/server-everything
(oncestructuredContent
tool is merged; otherwise, change the tested tool)
Future Test Coverage
Elicitation, Roots, Change Notifications in various contexts
Background
I'm curious about plans for the Inspector. I'd like to see it grow not only for debugging, but also learning and teaching MCP.
A few ideas:
I switched from Claude.ai to Desktop last winter to use reference MCP servers (filesystem & git) to eliminate copy/paste from Claude to vim. This was huge boost. I read the MCP spec and started writing my own MCP servers, which included tools and dynamic resources. Watching Claude use my own tools got me super fired up about MCP and building. 😄
I was surprised by Claude's lack of automatic use of Resources and awareness of server-level instructions. I found the client feature matrix, where most clients, notably Claude, lack discoverability and various other aspects of MCP. Since I exclusively use Claude, I'd like to see it with complete compliance. As the initiator of MCP, I think it would be great if Anthropic's client(s) were leading in this area.