#34201 MCP server update Search #34267

Laura-dotCMS · 2026-01-09T16:51:35Z

Proposed Changes

Updated the MCP server to use the new content drive API instead of the LLM crafting lucene queries.

Closes: #34201

Checklist

Tests
Translations
Security Implications Contemplated (add notes if applicable)

Additional Info

Updates create a more structured input/output for the LLM to consume.

Screenshots

Original	Updated
original screenshot	updated screenshot

…t will list the contents of a folder when queried. It will then load the raw json into the context (optional) to be used by the MCP server for further sorting and filtering.

…use the new Content Drive API. Updated the LLM search description with new directives.

…drive API instead of having the MCP server create lucene queries. REF: #34201

…34201

fmontes · 2026-01-16T22:33:56Z

core-web/apps/mcp-server/src/services/search.ts

+            this.serviceLogger.error('Invalid drive search parameters', validated.error);
            throw new Error(
-                'Invalid search parameters: ' + JSON.stringify(validated.error.format())
+                'Invalid drive search parameters: ' + JSON.stringify(validated.error.format())


Are we sure we want to call this "drive search" or just search or "asset search"?

I called it drive because that is the name of the endpoint we are hitting for the search. I can change it to whatever you'd like.

fmontes · 2026-01-16T22:42:27Z

core-web/apps/mcp-server/src/tools/search/description.ts

+- filters.filterFolders (boolean)
+  If true, excludes folders from results.

-"+Blog.body:("business" && "Apple")"
+- showFolders (boolean)
+  If true, explicitly includes folders in results.


WHAT! if this endpoints works like this we need to fix it, we can't have 2 params for include/exclude folder...

It is actually how the endpoint works -- Sample search from the interface below:

{
"assetPath": "//BankTech/",
"includeSystemHost": true,
"filters": {
"text": "",
"filterFolders": true
},
"contentTypes": [
"FileAsset",
"htmlpageasset"
],
"baseTypes": [
"FILEASSET",
"HTMLPAGE"
],
"offset": 0,
"maxResults": 20,
"sortBy": "modDate:desc",
"archived": false,
"showFolders": false
}

fmontes · 2026-01-16T22:43:16Z

core-web/apps/mcp-server/src/tools/search/description.ts

+----------------------------------------

-Use square brackets with TO for a value range. Use the strict date format "yyyyMMddHHmmss" for dates.
+Returns the raw Drive Search API response.


I think is confusing that we call this "Drive" search... we need to be consistent... asset search or something and tell explicity the LLM what an "asset" is.

Whhaaattttt, continuity? Crazy talk.

fmontes · 2026-01-16T22:44:02Z

core-web/apps/mcp-server/src/tools/search/description.ts

+- If results include multiple baseTypes (HTMLPAGE, FILEASSET, CONTENT, etc.),
+  group them by baseType unless the user asks otherwise.


Why the grouping?

Group them for visual recognition. Without grouping, if you ask the LLM to list the assets/info it will throw them out willy-nilly.

fmontes · 2026-01-16T22:45:03Z

core-web/apps/mcp-server/src/tools/search/description.ts

-When writing dotCMS Lucene queries, always use the correct content type and field format, explicit operators, escape special characters, stick to the strict date format, and do not generate raw OpenSearch JSON unless you really need advanced features.
+MENTAL MODEL:
+The MCP server is the source of truth.
+The agent's role is to faithfully expose its results, not reinterpret them.


Are we sure we want this? because human might ask interpretation of the results like "how many contents do I have about XYZ topic" or stuff like that.

The question you pose will still work because it parses the text to find the results. Without telling it to not reinterpret them, it hallucinates results, or gives responses it 'thinks' you want rather then using the data presented to it.

I was trying to find the example that got me to this instruction, but it is no longer available. Basically I told the search to tell me every file that contained the word "Adobe" in it. The LLM interpreted that to mean I just wanted to know the 'Pages' that had the word "Adobe" and so only returned the list of 5 or so pages, and none of the assets even though it had them in the temp file.

fmontes · 2026-01-16T22:46:14Z

core-web/apps/mcp-server/src/types/search.ts

+ * Drive Search response schema
+ * Matches the structure returned by /api/v1/drive/search
+ */
+export const DriveSearchItemSchema = z.record(z.string(), z.unknown());


Big no... drive/search return contentlets, we had contentlet schema, don't delete that.

It returns a different (slightly by about 5 or so fields) schema than the contentlet schema. I generalized it because the LLM doesn't need exact mapping to work, it will read off the response json. If you want an exact schema returned, I can map the one returned by the content drive, or go back to the contentlet schema. The new drive schema does have some nice features like all the workflows, permissions, and metadata available to that contentlet/asset.

fmontes · 2026-01-16T22:46:53Z

core-web/apps/mcp-server/tsconfig.json

        "strict": true,
-        "esModuleInterop": true
+        "esModuleInterop": true,
+        "importHelpers": false


What this do?

esModuleInterop lets you import old JS libraries with new syntax just to make life a little easier.
importHelpers false so you don't need an extra dependency tslib - keep helpers in the compiled file instead of reaching out to somewhere else.

Can remove them if you'd rather.

core-web/apps/mcp-server/src/utils/context-store.ts

Laura Cabrerizo and others added 4 commits January 7, 2026 13:44

chore(mcp-server): Added new 'List Folder Contents' functionality tha…

40a7c2e

…t will list the contents of a folder when queried. It will then load the raw json into the context (optional) to be used by the MCP server for further sorting and filtering.

Removed previously addded list-folder tool and updated the Search to …

39d3894

…use the new Content Drive API. Updated the LLM search description with new directives.

chore(mcp-server): Update the Search endpoint to use the new content …

c93d1b0

…drive API instead of having the MCP server create lucene queries. REF: #34201

chore(mcp-server): Small updates to remove artifacts from bad PR ref: #…

6ed992a

…34201

dotCMS deleted a comment from github-actions bot Jan 9, 2026

github-actions bot mentioned this pull request Jan 9, 2026

[FEATURE] MCP Server List Folder Contents #34201

Open

4 tasks

Laura-dotCMS and others added 5 commits January 9, 2026 14:48

Fixed Maven build errors

8fd9832

Fixed Maven build errors and tweaked search description

1341493

Updated failing unit tests to use new Contnet Drive search method.

d94079b

Updated failing unit tests to use new Contnet Drive search method try 2.

b0cc004

Merge branch 'main' into lcab_mcp_updates

9e0eb7a

fmontes requested changes Jan 16, 2026

View reviewed changes

Removed artifacts from context-store

671d7b5

		- If results include multiple baseTypes (HTMLPAGE, FILEASSET, CONTENT, etc.),
		group them by baseType unless the user asks otherwise.

#34201 MCP server update Search #34267

Are you sure you want to change the base?

#34201 MCP server update Search #34267

Conversation

Laura-dotCMS commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed Changes

Checklist

Additional Info

Screenshots

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Laura-dotCMS commented Jan 9, 2026 •

edited

Loading