{"id":1276,"date":"2024-10-22T10:28:05","date_gmt":"2024-10-22T10:28:05","guid":{"rendered":"http:\/\/localhost:8090\/?page_id=1276"},"modified":"2024-11-07T10:39:16","modified_gmt":"2024-11-07T09:39:16","slug":"main-content-extraction","status":"publish","type":"page","link":"https:\/\/cesy.dsic.upv.es\/es\/main-content-extraction\/","title":{"rendered":"Extracci\u00f3n del contenido principal"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-page\" data-elementor-id=\"1276\" class=\"elementor elementor-1276\">\n\t\t\t\t<div class=\"elementor-element elementor-element-ca22a01 e-flex e-con-boxed e-con e-parent\" data-id=\"ca22a01\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-7b84c35 elementor-widget elementor-widget-elementskit-heading\" data-id=\"7b84c35\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"elementskit-heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<div class=\"ekit-wid-con\" ><div class=\"ekit-heading elementskit-section-title-wraper    ekit_heading_tablet-   ekit_heading_mobile-\"><h1 class=\"ekit-heading--title elementskit-section-title text_fill\">Main content extraction<\/h1><div class=\"ekit_heading_separetor_wraper ekit_heading_elementskit-border-divider ekit-dotted\"><div class=\"elementskit-border-divider ekit-dotted\"><\/div><\/div><h3 class=\"ekit-heading--subtitle elementskit-section-subtitle  \">\n\t\t\t\t\t\tExtract the essential information of a webpage.\n\t\t\t\t\t<\/h3><\/div><\/div>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-8b29329 e-flex e-con-boxed e-con e-parent\" data-id=\"8b29329\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t<div class=\"elementor-element elementor-element-43362ff e-con-full e-flex e-con e-child\" data-id=\"43362ff\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-463ad59 elementor-widget elementor-widget-heading\" data-id=\"463ad59\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Why is it useful?<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-6c3f00b elementor-widget elementor-widget-text-editor\" data-id=\"6c3f00b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p class=\"p1\">Extracting information from a website is useful and important for every user. However it&#8217;s not always an easy task. When you browse the web you can find a lot of noisy and useless elements that can be annoying.<\/p><p class=\"p1\">The main content in a webpage contains the relevant content to the user. It is usually composed of text, images, and any other multimedia; and it is typically surrounded or even interrupted by irrelevant information, such as headers, footers, menus, banners, advertisements, etc.<\/p><p class=\"p1\">The main content in a webpage can be useful for:<\/p><ul class=\"ul1\"><li class=\"li1\"><b>Accessibility tools<\/b>, because they can automatically start reading the actual content of the page.<\/li><li class=\"li1\"><b>Other systems and tools, such as indexers or wrappers<\/b>, as a preliminary stage to avoid banners and unnecessary content in later phases of the analysis.<\/li><\/ul><p class=\"p1\">One important advantage of this tool is that it not only extract the main content text from the webpage, but also images, videos, and any other multimedia.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-1fc9dce e-con-full e-flex e-con e-child\" data-id=\"1fc9dce\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-a38c66f elementor-widget elementor-widget-image\" data-id=\"a38c66f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img fetchpriority=\"high\" decoding=\"async\" width=\"300\" height=\"300\" src=\"https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/10\/text-mining-1476780_640-300x300.webp\" class=\"attachment-medium size-medium wp-image-1309\" alt=\"\" srcset=\"https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/10\/text-mining-1476780_640-300x300.webp 300w, https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/10\/text-mining-1476780_640-150x150.webp 150w, https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/10\/text-mining-1476780_640.webp 640w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-23195ae e-flex e-con-boxed e-con e-child\" data-id=\"23195ae\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-9187a7c elementor-widget elementor-widget-heading\" data-id=\"9187a7c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Two kinds of extractors<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-f392b71 elementor-widget elementor-widget-text-editor\" data-id=\"f392b71\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>A <strong>page-level technique<\/strong> only takes into account the elements, DOM nodes and text of the URL given as input. The main benefit of a page-level tool is that it only needs to load and analyze one single webpage to detect the main content. The speed of the algorithm is increased compared to site-level techniques.<\/p><p>A <strong>site-level technique<\/strong>, on the other hand, goes beyond analyzing a single page. In addition to the given URL, it loads and examines other pages from the same website to identify recurring patterns, which helps in accurately extracting the main content. Although this approach is slower, it enhances the reliability of the extraction by using insights from multiple pages within the same site.<\/p><p>When you use <strong>CESY<\/strong>, you can choose the kind of extractor you prefer.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-90b1040 elementor-hidden-desktop elementor-hidden-tablet elementor-hidden-mobile e-flex e-con-boxed e-con e-parent\" data-id=\"90b1040\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-378b6ca elementor-widget elementor-widget-heading\" data-id=\"378b6ca\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">FAQ<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-21373cc elementor-widget elementor-widget-n-accordion\" data-id=\"21373cc\" data-element_type=\"widget\" data-e-type=\"widget\" data-settings=\"{&quot;default_state&quot;:&quot;all_collapsed&quot;,&quot;n_accordion_animation_duration&quot;:{&quot;unit&quot;:&quot;ms&quot;,&quot;size&quot;:200,&quot;sizes&quot;:[]},&quot;max_items_expended&quot;:&quot;one&quot;}\" data-widget_type=\"nested-accordion.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"e-n-accordion\" aria-label=\"Accordion. Open links with Enter or Space, close with Escape, and navigate with Arrow Keys\">\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-3480\" class=\"e-n-accordion-item\" >\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"1\" tabindex=\"0\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-3480\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\"> What's MEW for? <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewBox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewBox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-3480\" class=\"elementor-element elementor-element-a6643c4 e-con-full e-flex e-con e-child\" data-id=\"a6643c4\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-e6c6bf2 elementor-widget elementor-widget-text-editor\" data-id=\"e6c6bf2\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<ul><li><b>Extract Content:<\/b> It automatically extracts the main content of a webpage.<\/li><li><strong>Format Output: <\/strong>It extracts main content in HTML, XML, JSON and plain text format.<\/li><\/ul>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-3481\" class=\"e-n-accordion-item\" >\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"2\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-3481\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\"> How do I use MEW? <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewBox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewBox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-3481\" class=\"elementor-element elementor-element-fba2e95 e-flex e-con-boxed e-con e-child\" data-id=\"fba2e95\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-3482\" class=\"e-n-accordion-item\" >\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"3\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-3482\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\"> Who developed MEW? <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewBox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewBox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-3482\" class=\"elementor-element elementor-element-f333e8f e-flex e-con-boxed e-con e-child\" data-id=\"f333e8f\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-68cc0f2 elementor-widget elementor-widget-text-editor\" data-id=\"68cc0f2\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>This software has been designed and implemented in the computer science labs of the\u00a0<a title=\"Universitat Politecnica de Valencia\" href=\"http:\/\/www.upv.es\/\">UPV<\/a>.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t\t<details id=\"e-n-accordion-item-3483\" class=\"e-n-accordion-item\" >\n\t\t\t\t<summary class=\"e-n-accordion-item-title\" data-accordion-index=\"4\" tabindex=\"-1\" aria-expanded=\"false\" aria-controls=\"e-n-accordion-item-3483\" >\n\t\t\t\t\t<span class='e-n-accordion-item-title-header'><div class=\"e-n-accordion-item-title-text\"> Is it MEW able to work on a synchronous way? <\/div><\/span>\n\t\t\t\t\t\t\t<span class='e-n-accordion-item-title-icon'>\n\t\t\t<span class='e-opened' ><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-minus\" viewBox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t\t<span class='e-closed'><svg aria-hidden=\"true\" class=\"e-font-icon-svg e-fas-plus\" viewBox=\"0 0 448 512\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\"><path d=\"M416 208H272V64c0-17.67-14.33-32-32-32h-32c-17.67 0-32 14.33-32 32v144H32c-17.67 0-32 14.33-32 32v32c0 17.67 14.33 32 32 32h144v144c0 17.67 14.33 32 32 32h32c17.67 0 32-14.33 32-32V304h144c17.67 0 32-14.33 32-32v-32c0-17.67-14.33-32-32-32z\"><\/path><\/svg><\/span>\n\t\t<\/span>\n\n\t\t\t\t\t\t<\/summary>\n\t\t\t\t<div role=\"region\" aria-labelledby=\"e-n-accordion-item-3483\" class=\"elementor-element elementor-element-4803c3e e-flex e-con-boxed e-con e-child\" data-id=\"4803c3e\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-3d0f48b elementor-widget elementor-widget-text-editor\" data-id=\"3d0f48b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Yes, MEW is able to work on a synchronous and asynchronous way.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/details>\n\t\t\t\t\t<\/div>\n\t\t\t\t\t<script type=\"application\/ld+json\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@type\":\"FAQPage\",\"mainEntity\":[{\"@type\":\"Question\",\"name\":\"What's MEW for?\",\"acceptedAnswer\":{\"@type\":\"Answer\",\"text\":\"Extract Content: It automatically extracts the main content of a webpage.Format Output: It extracts main content in HTML, XML, JSON and plain text format.\"}},{\"@type\":\"Question\",\"name\":\"How do I use MEW?\",\"acceptedAnswer\":{\"@type\":\"Answer\",\"text\":\"\"}},{\"@type\":\"Question\",\"name\":\"Who developed MEW?\",\"acceptedAnswer\":{\"@type\":\"Answer\",\"text\":\"This software has been designed and implemented in the computer science labs of the\\u00a0UPV.\"}},{\"@type\":\"Question\",\"name\":\"Is it MEW able to work on a synchronous way?\",\"acceptedAnswer\":{\"@type\":\"Answer\",\"text\":\"Yes, MEW is able to work on a synchronous and asynchronous way.\"}}]}<\/script>\n\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-a8bbecf e-flex e-con-boxed e-con e-parent\" data-id=\"a8bbecf\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-a6ac892 elementor-widget elementor-widget-heading\" data-id=\"a6ac892\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Examples<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-42351c2 elementor-widget elementor-widget-text-editor\" data-id=\"42351c2\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<div class=\"elementor-element elementor-element-69d4e8b elementor-widget elementor-widget-text-editor\" data-id=\"69d4e8b\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\"><div class=\"elementor-widget-container\"><p>Drag the slider to see the webpage before and after extracting its main content. All other elements are hidden away.<\/p><\/div><\/div>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-3101e50 e-con-full jxp-example-grid e-flex e-con e-child\" data-id=\"3101e50\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t<div class=\"elementor-element elementor-element-1aa0226 e-con-full e-flex e-con e-child\" data-id=\"1aa0226\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-6cf2633 elementor-widget elementor-widget-text-editor\" data-id=\"6cf2633\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>1. New York University&#8217;s History\u00a0 (<a href=\"https:\/\/www.nyu.edu\/about\/news-publications\/history-of-nyu.html\" data-wplink-edit=\"true\">original<\/a>)<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-83a43a8 elementor-widget elementor-widget-elementskit-image-comparison\" data-id=\"83a43a8\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"elementskit-image-comparison.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<div class=\"ekit-wid-con\" >\n\t\t<div class=\"elementskit-image-comparison image-comparison-container\" data-offset=\"0.5\" data-overlay=\"\" data-label_after=\"After\" data-label_before=\"Before\" data-move_slider_on_hover=\"\" data-click_to_move=\"\">\n\t\t\t<img decoding=\"async\" src=\"https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/10\/ex1.png\" title=\"ex1\" alt=\"ex1\" loading=\"lazy\" \/><img decoding=\"async\" src=\"https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/10\/ex1.2.png\" title=\"ex1.2\" alt=\"ex1.2\" loading=\"lazy\" \/>\t\t<\/div>\n\n\t<\/div>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-7ea9361 e-con-full e-flex e-con e-child\" data-id=\"7ea9361\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-ba8dcd9 elementor-widget elementor-widget-text-editor\" data-id=\"ba8dcd9\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>2. United Nations, News &amp; Media, French (<a href=\"https:\/\/www.un.org\/fr\/our-work\">original<\/a>)<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2f6d8ab elementor-widget elementor-widget-elementskit-image-comparison\" data-id=\"2f6d8ab\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"elementskit-image-comparison.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<div class=\"ekit-wid-con\" >\n\t\t<div class=\"elementskit-image-comparison image-comparison-container\" data-offset=\"0.5\" data-overlay=\"\" data-label_after=\"After\" data-label_before=\"Before\" data-move_slider_on_hover=\"\" data-click_to_move=\"\">\n\t\t\t<img decoding=\"async\" src=\"https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/11\/nationsunites2.png\" title=\"nationsunites2\" alt=\"nationsunites2\" loading=\"lazy\" \/><img decoding=\"async\" src=\"https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/11\/nationsunites1.png\" title=\"nationsunites1\" alt=\"nationsunites1\" loading=\"lazy\" \/>\t\t<\/div>\n\n\t<\/div>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-4be1b1a e-con-full e-flex e-con e-child\" data-id=\"4be1b1a\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-3b74a41 elementor-widget elementor-widget-text-editor\" data-id=\"3b74a41\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>3. Linux Mint Partners Page (<a href=\"https:\/\/linuxmint.com\/partners.php\">original<\/a>)<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-c942edf elementor-widget elementor-widget-elementskit-image-comparison\" data-id=\"c942edf\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"elementskit-image-comparison.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<div class=\"ekit-wid-con\" >\n\t\t<div class=\"elementskit-image-comparison image-comparison-container\" data-offset=\"0.5\" data-overlay=\"\" data-label_after=\"After\" data-label_before=\"Before\" data-move_slider_on_hover=\"\" data-click_to_move=\"\">\n\t\t\t<img decoding=\"async\" src=\"https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/11\/linux2.png\" title=\"linux2\" alt=\"linux2\" loading=\"lazy\" \/><img decoding=\"async\" src=\"https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/11\/linux1.png\" title=\"linux1\" alt=\"linux1\" loading=\"lazy\" \/>\t\t<\/div>\n\n\t<\/div>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-80cb46b e-con-full e-flex e-con e-child\" data-id=\"80cb46b\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-bb79a7d elementor-widget elementor-widget-text-editor\" data-id=\"bb79a7d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>4. Industry Congress, Digital Twins news (<a href=\"https:\/\/congresoindustria.gob.es\/congreso-nacional-de-industria\/digital-twin-para-la-industria-del-agua\/?_gl=1*v4a6ot*_up*MQ..*_ga*MTQ1MDI1NjQwOC4xNzMwNzIwMzgw*_ga_0LEFE7ZFQ9*MTczMDcyMDM3OS4xLjAuMTczMDcyMDM3OS4wLjAuMA..\">original<\/a>)<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-db0eb9d elementor-widget elementor-widget-elementskit-image-comparison\" data-id=\"db0eb9d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"elementskit-image-comparison.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<div class=\"ekit-wid-con\" >\n\t\t<div class=\"elementskit-image-comparison image-comparison-container\" data-offset=\"0.5\" data-overlay=\"\" data-label_after=\"After\" data-label_before=\"Before\" data-move_slider_on_hover=\"\" data-click_to_move=\"\">\n\t\t\t<img decoding=\"async\" src=\"https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/11\/digitaltwins2.png\" title=\"digitaltwins2\" alt=\"digitaltwins2\" loading=\"lazy\" \/><img decoding=\"async\" src=\"https:\/\/cesy.dsic.upv.es\/wp-content\/uploads\/2024\/11\/digitaltwins1.png\" title=\"digitaltwins1\" alt=\"digitaltwins1\" loading=\"lazy\" \/>\t\t<\/div>\n\n\t<\/div>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-4c36c75d e-flex e-con-boxed e-con e-parent\" data-id=\"4c36c75d\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-2273a064 elementor-widget elementor-widget-heading\" data-id=\"2273a064\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Want to try it yourself?<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9b9e4ac elementor-widget elementor-widget-button\" data-id=\"9b9e4ac\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"button.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<div class=\"elementor-button-wrapper\">\n\t\t\t\t\t<a class=\"elementor-button elementor-button-link elementor-size-sm\" href=\"\/contact\">\n\t\t\t\t\t\t<span class=\"elementor-button-content-wrapper\">\n\t\t\t\t\t\t\t\t\t<span class=\"elementor-button-text\">Request a demo<\/span>\n\t\t\t\t\t<\/span>\n\t\t\t\t\t<\/a>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Main content extraction Extract the essential information of a webpage. Why is it useful? Extracting information from a website is useful and important for every user. However it&#8217;s not always an easy task. When you browse the web you can find a lot of noisy and useless elements that can be annoying. The main content [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_uag_custom_page_level_css":"","site-sidebar-layout":"no-sidebar","site-content-layout":"page-builder","ast-site-content-layout":"full-width-container","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"disabled","ast-breadcrumbs-content":"","ast-featured-img":"disabled","footer-sml-layout":"","ast-disable-related-posts":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"set","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"class_list":["post-1276","page","type-page","status-publish","hentry"],"uagb_featured_image_src":{"full":false,"thumbnail":false,"medium":false,"medium_large":false,"large":false,"1536x1536":false,"2048x2048":false,"trp-custom-language-flag":false},"uagb_author_info":{"display_name":"cmarabe1","author_link":"https:\/\/cesy.dsic.upv.es\/es\/author\/cmarabe1\/"},"uagb_comment_info":0,"uagb_excerpt":"Main content extraction Extract the essential information of a webpage. Why is it useful? Extracting information from a website is useful and important for every user. However it&#8217;s not always an easy task. When you browse the web you can find a lot of noisy and useless elements that can be annoying. The main content&hellip;","_links":{"self":[{"href":"https:\/\/cesy.dsic.upv.es\/es\/wp-json\/wp\/v2\/pages\/1276","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cesy.dsic.upv.es\/es\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/cesy.dsic.upv.es\/es\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/cesy.dsic.upv.es\/es\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/cesy.dsic.upv.es\/es\/wp-json\/wp\/v2\/comments?post=1276"}],"version-history":[{"count":280,"href":"https:\/\/cesy.dsic.upv.es\/es\/wp-json\/wp\/v2\/pages\/1276\/revisions"}],"predecessor-version":[{"id":3007,"href":"https:\/\/cesy.dsic.upv.es\/es\/wp-json\/wp\/v2\/pages\/1276\/revisions\/3007"}],"wp:attachment":[{"href":"https:\/\/cesy.dsic.upv.es\/es\/wp-json\/wp\/v2\/media?parent=1276"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}