Skip to content

[DomCrawler] Add more details about some methods #21034

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: 6.4
Choose a base branch
from

Conversation

javiereguiluz
Copy link
Member

After merging #21031 I thought that we could improve some explanations of some methods with more details and references.

Copy link

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR enriches the DomCrawler documentation by expanding method descriptions with more detail and adding references to related components and resources.

  • Expanded explanations for common Crawler methods to clarify their behavior and usage.
  • Added a link to Symfony’s CSS Selector component documentation.
  • Introduced an external reference for “XPath expression.”

Copy link
Member

@stof stof left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should probably also make it clear which methods are meant to be called on a single-node Crawler because they ignore other nodes than the first one (any method where the implementation rely on using $this->getNode(0) basically).

We have several places guaranteed to be dealing with single-node crawlers in the API:

  • the callback of each
  • the callback of reduce
  • the return value of eq (and its shortcut) for a valid position (for an invalid position, it will be an empty crawler for which those methods relying on the first node will throw an exception)

``first()``
First node.
Selects the first node (equivalent to ``eq(0)``).
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using Selects in all those descriptions is confusing to me. We have different kinds of methods in this lists:

  • methods creating a Crawler containing only a subset of the nodes of the current Crawler instance (eq and all its shorcuts, but also reduce which actually filters the current instance and slice)
  • methods returning a new Crawler with a different selection (which also includes the filter and filterXpath methods, despite their name, as they are doing a mix of matching the existing nodes or finding their descendants)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You are right, but this doc is in testing/dom_crawler (we also have components/dom_crawler) so this is probably an internal detail not relevant in this page. 🤔

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Well, if you improve the documentation of available methods to make it easier to know what they do (without going to another page), it is relevant to make it clear what they do IMO. Otherwise, you could remove the list and make them go to components/dom_crawler to get the proper explanation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants