Expand my Community achievements bar.

Enhance your AEM Assets & Boost Your Development: [AEM Gems | June 19, 2024] Improving the Developer Experience with New APIs and Events

Unable to get the html content of a page, which contains data from external api

Avatar

Level 2

Hi,

 

When I am trying to access the html content of a page using jsoup, I am unable to get the content which is referring form api. Can someone help me in getting the content, which is coming from external api, in java.

 

Thanks in advance.

1 Reply

Avatar

Level 8

Hi @himasreep445197 

 

Lets say you have input stream of the page getting from external api, then using below code snippet we will be able access page content, elements & iterate through the elements.

 

InputStream inputStream; //fetch from external api;
String htmlContent = IOUtils.toString(inputStream);
Document document = Jsoup.parse(htmlContent);
Elements imageElements = document.getElementsByTag("img");
Elements metaElements = document.getElementsByTag("meta");
Elements headElement = document.getElementsByTag("head");
Elements scriptElements = document.getElementsByTag("script");

scriptElements.forEach(
scriptElement ->
{
String scriptPath = scriptElement.attr("src");
}
);