There is any way to extract the content of a HTML page that starts from <body> and ends with </body> in php. If there can anyone post some sample code.
Extract a content of a html page in php
8.7k Views Asked by bharathi At
3
There are 3 best solutions below
0
On
You should have a look at the DOMDocument reference.
This example reads a html document, creates a DOMDocument and gets the body tag:
libxml_use_internal_errors(true);
$dom = new DOMDocument;
$dom->loadHTMLFile('http://example.com');
libxml_use_internal_errors(false);
$body = $dom->getElementsByTagName('body')->item(0);
echo $body->textContent; // print all the text content in the body
You should also check out the following resources:
0
On
You can also try to use non-DOM solution based on strpos function:
$html = file_get_contents($url);
$html = substr($html,stripos($html,'<body>')+6);
$html = substr($html,0,strripos($html,'</body>'));
stripos is case insensitive version of strpos, strripos is case insensitive 'rightmost position' version of strpos.
Hope that it will help you!
Try PHP Simple HTML DOM Parser