{"id":5977,"date":"2025-03-17T11:03:45","date_gmt":"2025-03-17T11:03:45","guid":{"rendered":"https:\/\/aborrego.inscastellbisbal.net\/?p=5977"},"modified":"2025-03-25T12:24:10","modified_gmt":"2025-03-25T12:24:10","slug":"recollir-informacio-via-web-scraping","status":"publish","type":"post","link":"https:\/\/aborrego.inscastellbisbal.net\/en\/2025\/03\/17\/recollir-informacio-via-web-scraping\/","title":{"rendered":"Recollir informaci\u00f3 via web scraping"},"content":{"rendered":"<div data-elementor-type=\"wp-post\" data-elementor-id=\"5977\" class=\"elementor elementor-5977\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-beddf63 elementor-section-height-min-height elementor-section-items-top elementor-section-boxed elementor-section-height-default wpr-particle-no wpr-jarallax-no wpr-parallax-no wpr-sticky-section-no wpr-equal-height-no elementor-invisible\" data-id=\"beddf63\" data-element_type=\"section\" data-e-type=\"section\" data-settings=\"{&quot;background_background&quot;:&quot;classic&quot;,&quot;animation&quot;:&quot;fadeIn&quot;,&quot;shape_divider_top&quot;:&quot;waves&quot;,&quot;shape_divider_top_negative&quot;:&quot;yes&quot;}\">\n\t\t\t\t\t<div class=\"elementor-shape elementor-shape-top\" aria-hidden=\"true\" data-negative=\"true\">\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" viewbox=\"0 0 1000 100\" preserveaspectratio=\"none\">\n\t<path class=\"elementor-shape-fill\" d=\"M790.5,93.1c-59.3-5.3-116.8-18-192.6-50c-29.6-12.7-76.9-31-100.5-35.9c-23.6-4.9-52.6-7.8-75.5-5.3\tc-10.2,1.1-22.6,1.4-50.1,7.4c-27.2,6.3-58.2,16.6-79.4,24.7c-41.3,15.9-94.9,21.9-134,22.6C72,58.2,0,25.8,0,25.8V100h1000V65.3\tc0,0-51.5,19.4-106.2,25.7C839.5,97,814.1,95.2,790.5,93.1z\"\/>\n<\/svg>\t\t<\/div>\n\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-top-column elementor-element elementor-element-28432e3\" data-id=\"28432e3\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-8b9beef elementor-invisible elementor-widget elementor-widget-heading\" data-id=\"8b9beef\" data-element_type=\"widget\" data-e-type=\"widget\" data-settings=\"{&quot;_animation&quot;:&quot;fadeInUp&quot;,&quot;_animation_delay&quot;:500}\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h1 class=\"elementor-heading-title elementor-size-default\">RECOLLIR INFORMACI\u00d3 VIA WEB SCRAPING<\/h1>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-42c6e5f elementor-widget elementor-widget-spacer\" data-id=\"42c6e5f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"spacer.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-spacer\">\n\t\t\t<div class=\"elementor-spacer-inner\"><\/div>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-406156d elementor-widget elementor-widget-spacer\" data-id=\"406156d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"spacer.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-spacer\">\n\t\t\t<div class=\"elementor-spacer-inner\"><\/div>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-top-column elementor-element elementor-element-026a130\" data-id=\"026a130\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-53df6bd elementor-align-left elementor-widget elementor-widget-button\" data-id=\"53df6bd\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"button.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<div class=\"elementor-button-wrapper\">\n\t\t\t\t\t<a class=\"elementor-button elementor-button-link elementor-size-sm elementor-animation-float\" href=\"https:\/\/aborrego.inscastellbisbal.net\/en\/repte-1-6-talent-fp\/\">\n\t\t\t\t\t\t<span class=\"elementor-button-content-wrapper\">\n\t\t\t\t\t\t\t\t\t<span class=\"elementor-button-text\">Tornar al repte 1.6<\/span>\n\t\t\t\t\t<\/span>\n\t\t\t\t\t<\/a>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t<div class=\"elementor-element elementor-element-cd695cb e-flex e-con-boxed wpr-particle-no wpr-jarallax-no wpr-parallax-no wpr-sticky-section-no wpr-equal-height-no e-con e-parent\" data-id=\"cd695cb\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-40809a9 elementor-widget elementor-widget-spacer\" data-id=\"40809a9\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"spacer.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-spacer\">\n\t\t\t<div class=\"elementor-spacer-inner\"><\/div>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-22e7d33 elementor-widget elementor-widget-text-editor\" data-id=\"22e7d33\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Primer, vaig haver d\u2019acabar d\u2019afegir tota la informaci\u00f3 al codi fins que vaig aconseguir que les respostes del xatbot fossin del meu gust. Despr\u00e9s, vaig ajustar algunes altres configuracions. Com que vaig voler que el meu xatbot simul\u00e9s que era jo, vaig canviar el seu nom a <em data-start=\"335\" data-end=\"340\">Ana<\/em> i vaig fer altres modificacions a les configuracions.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-858267c elementor-widget elementor-widget-image\" data-id=\"858267c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img fetchpriority=\"high\" decoding=\"async\" width=\"700\" height=\"103\" src=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303-1024x150.png\" class=\"attachment-large size-large wp-image-5980\" alt=\"\" srcset=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303-1024x150.png 1024w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303-300x44.png 300w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303-768x113.png 768w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303-1536x225.png 1536w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303.png 1745w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-60a93be elementor-widget elementor-widget-image\" data-id=\"60a93be\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" width=\"700\" height=\"120\" src=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_291-1024x176.png\" class=\"attachment-large size-large wp-image-5982\" alt=\"\" srcset=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_291-1024x176.png 1024w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_291-300x52.png 300w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_291-768x132.png 768w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_291-1536x264.png 1536w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_291.png 1825w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-d7a114e e-flex e-con-boxed wpr-particle-no wpr-jarallax-no wpr-parallax-no wpr-sticky-section-no wpr-equal-height-no e-con e-parent\" data-id=\"d7a114e\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t<div class=\"elementor-element elementor-element-126a34f e-con-full e-flex wpr-particle-no wpr-jarallax-no wpr-parallax-no wpr-sticky-section-no wpr-equal-height-no e-con e-child\" data-id=\"126a34f\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-4a8d725 elementor-widget elementor-widget-text-editor\" data-id=\"4a8d725\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Un cop fet aix\u00f2, vaig preguntar a ChatGPT com fer <em data-start=\"446\" data-end=\"460\">web scraping<\/em>, adjuntant-li tot el codi que tenia fins aquell moment. <\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4480029 elementor-widget elementor-widget-image\" data-id=\"4480029\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" width=\"556\" height=\"238\" src=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_286.png\" class=\"attachment-large size-large wp-image-5983\" alt=\"\" srcset=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_286.png 556w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_286-300x128.png 300w\" sizes=\"(max-width: 556px) 100vw, 556px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-a4b70ee e-con-full e-flex wpr-particle-no wpr-jarallax-no wpr-parallax-no wpr-sticky-section-no wpr-equal-height-no e-con e-child\" data-id=\"a4b70ee\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-84243fc elementor-widget__width-initial elementor-widget elementor-widget-text-editor\" data-id=\"84243fc\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Com que la meva p\u00e0gina \u00e9s HTML est\u00e0tic, vaig haver d\u2019instal\u00b7lar la llibreria <em data-start=\"594\" data-end=\"609\">BeautifulSoup<\/em> per poder extreure informaci\u00f3. <\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-235e74b elementor-widget elementor-widget-image\" data-id=\"235e74b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"700\" height=\"89\" src=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_288.png\" class=\"attachment-large size-large wp-image-5984\" alt=\"\" srcset=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_288.png 748w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_288-300x38.png 300w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-0f34276 e-con-full e-flex wpr-particle-no wpr-jarallax-no wpr-parallax-no wpr-sticky-section-no wpr-equal-height-no e-con e-child\" data-id=\"0f34276\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t<div class=\"elementor-element elementor-element-9da53cb elementor-widget elementor-widget-text-editor\" data-id=\"9da53cb\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>\u00a0Un cop instal\u00b7lada, vaig verificar que tot estigu\u00e9s correctament configurat.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-8e7c74b elementor-widget elementor-widget-image\" data-id=\"8e7c74b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"490\" height=\"137\" src=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_304.png\" class=\"attachment-large size-large wp-image-5993\" alt=\"\" srcset=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_304.png 490w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_304-300x84.png 300w\" sizes=\"(max-width: 490px) 100vw, 490px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-3f633b0 e-flex e-con-boxed wpr-particle-no wpr-jarallax-no wpr-parallax-no wpr-sticky-section-no wpr-equal-height-no e-con e-parent\" data-id=\"3f633b0\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-ae0d0b4 elementor-widget elementor-widget-image\" data-id=\"ae0d0b4\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"700\" height=\"197\" src=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_287-1024x288.png\" class=\"attachment-large size-large wp-image-5992\" alt=\"\" srcset=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_287-1024x288.png 1024w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_287-300x84.png 300w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_287-768x216.png 768w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_287.png 1191w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t<div class=\"elementor-element elementor-element-437f498 e-flex e-con-boxed wpr-particle-no wpr-jarallax-no wpr-parallax-no wpr-sticky-section-no wpr-equal-height-no e-con e-parent\" data-id=\"437f498\" data-element_type=\"container\" data-e-type=\"container\">\n\t\t\t\t\t<div class=\"e-con-inner\">\n\t\t\t\t<div class=\"elementor-element elementor-element-b9ed710 elementor-widget elementor-widget-text-editor\" data-id=\"b9ed710\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Despr\u00e9s, vaig afegir el codi que ChatGPT em va proporcionar, situant-lo abans del bucle del xat. Aquest codi permetia extreure el text de la p\u00e0gina, obtenir tots els enlla\u00e7os i veure els encap\u00e7alaments.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-9111e9b elementor-widget__width-initial elementor-widget elementor-widget-image\" data-id=\"9111e9b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"700\" height=\"596\" src=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_290.png\" class=\"attachment-large size-large wp-image-5997\" alt=\"\" srcset=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_290.png 813w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_290-300x255.png 300w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_290-768x654.png 768w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-57db843 elementor-widget__width-initial elementor-widget elementor-widget-image\" data-id=\"57db843\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"700\" height=\"136\" src=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_293-1024x199.png\" class=\"attachment-large size-large wp-image-5999\" alt=\"\" srcset=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_293-1024x199.png 1024w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_293-300x58.png 300w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_293-768x150.png 768w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_293-1536x299.png 1536w, https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_293.png 1803w\" sizes=\"(max-width: 700px) 100vw, 700px\" \/>\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-72ec416 elementor-widget elementor-widget-spacer\" data-id=\"72ec416\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"spacer.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-spacer\">\n\t\t\t<div class=\"elementor-spacer-inner\"><\/div>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>","protected":false},"excerpt":{"rendered":"<p>RECOLLIR INFORMACI\u00d3 VIA WEB SCRAPING Tornar al repte 1.6 Primer, vaig haver d\u2019acabar d\u2019afegir tota la informaci\u00f3 al codi fins que vaig aconseguir que les respostes del xatbot fossin del meu gust. Despr\u00e9s, vaig ajustar algunes altres configuracions. Com que vaig voler que el meu xatbot simul\u00e9s que era jo, vaig canviar el seu nom a Ana i vaig fer altres modificacions a les configuracions. Un cop fet aix\u00f2, vaig preguntar a ChatGPT com fer web scraping, adjuntant-li tot el codi que tenia fins aquell moment. Com que la meva p\u00e0gina \u00e9s HTML est\u00e0tic, vaig haver d\u2019instal\u00b7lar la llibreria BeautifulSoup per poder extreure informaci\u00f3. \u00a0Un cop instal\u00b7lada, vaig verificar que tot estigu\u00e9s correctament configurat. Despr\u00e9s, vaig afegir el codi que ChatGPT em va proporcionar, situant-lo abans del bucle del xat. Aquest codi permetia extreure el text de la p\u00e0gina, obtenir tots els enlla\u00e7os i veure els encap\u00e7alaments.<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[95,84],"tags":[],"class_list":["post-5977","post","type-post","status-publish","format-standard","hentry","category-repte-1-6","category-disseny-i-aplicacio-de-la-ia"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Recollir informaci\u00f3 via web scraping - Ana Borrego Toledo<\/title>\n<meta name=\"robots\" content=\"noindex, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Recollir informaci\u00f3 via web scraping - Ana Borrego Toledo\" \/>\n<meta property=\"og:description\" content=\"RECOLLIR INFORMACI\u00d3 VIA WEB SCRAPING Tornar al repte 1.6 Primer, vaig haver d\u2019acabar d\u2019afegir tota la informaci\u00f3 al codi fins que vaig aconseguir que les respostes del xatbot fossin del meu gust. Despr\u00e9s, vaig ajustar algunes altres configuracions. Com que vaig voler que el meu xatbot simul\u00e9s que era jo, vaig canviar el seu nom a Ana i vaig fer altres modificacions a les configuracions. Un cop fet aix\u00f2, vaig preguntar a ChatGPT com fer web scraping, adjuntant-li tot el codi que tenia fins aquell moment. Com que la meva p\u00e0gina \u00e9s HTML est\u00e0tic, vaig haver d\u2019instal\u00b7lar la llibreria BeautifulSoup per poder extreure informaci\u00f3. \u00a0Un cop instal\u00b7lada, vaig verificar que tot estigu\u00e9s correctament configurat. Despr\u00e9s, vaig afegir el codi que ChatGPT em va proporcionar, situant-lo abans del bucle del xat. Aquest codi permetia extreure el text de la p\u00e0gina, obtenir tots els enlla\u00e7os i veure els encap\u00e7alaments.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/aborrego.inscastellbisbal.net\/en\/2025\/03\/17\/recollir-informacio-via-web-scraping\/\" \/>\n<meta property=\"og:site_name\" content=\"Ana Borrego Toledo\" \/>\n<meta property=\"article:published_time\" content=\"2025-03-17T11:03:45+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-03-25T12:24:10+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1745\" \/>\n\t<meta property=\"og:image:height\" content=\"256\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"aborrego\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"aborrego\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/\"},\"author\":{\"name\":\"aborrego\",\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/#\\\/schema\\\/person\\\/596b5d3dc4735c43cfaafed47741b9d1\"},\"headline\":\"Recollir informaci\u00f3 via web scraping\",\"datePublished\":\"2025-03-17T11:03:45+00:00\",\"dateModified\":\"2025-03-25T12:24:10+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/\"},\"wordCount\":165,\"publisher\":{\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/Selection_303-1024x150.png\",\"articleSection\":[\"Repte 1.6\",\"\ud83e\udd16- Disseny i aplicaci\u00f3 de la IA\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/\",\"url\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/\",\"name\":\"Recollir informaci\u00f3 via web scraping - Ana Borrego Toledo\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/Selection_303-1024x150.png\",\"datePublished\":\"2025-03-17T11:03:45+00:00\",\"dateModified\":\"2025-03-25T12:24:10+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/#primaryimage\",\"url\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/Selection_303-1024x150.png\",\"contentUrl\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/Selection_303-1024x150.png\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/2025\\\/03\\\/17\\\/recollir-informacio-via-web-scraping\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Recollir informaci\u00f3 via web scraping\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/#website\",\"url\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/\",\"name\":\"Ana Borrego Toledo\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/#organization\",\"name\":\"Ana Borrego Toledo\",\"url\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/cropped-Selection_255.png\",\"contentUrl\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/cropped-Selection_255.png\",\"width\":549,\"height\":328,\"caption\":\"Ana Borrego Toledo\"},\"image\":{\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/#\\\/schema\\\/logo\\\/image\\\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/#\\\/schema\\\/person\\\/596b5d3dc4735c43cfaafed47741b9d1\",\"name\":\"aborrego\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/e158785622f9364a28d322a7a8711e2ea34ebb67263b57ff915d7b49cb3cf0d1?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/e158785622f9364a28d322a7a8711e2ea34ebb67263b57ff915d7b49cb3cf0d1?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/e158785622f9364a28d322a7a8711e2ea34ebb67263b57ff915d7b49cb3cf0d1?s=96&d=mm&r=g\",\"caption\":\"aborrego\"},\"sameAs\":[\"https:\\\/\\\/aborrego.inscastellbisbal.net\",\"https:\\\/\\\/www.instagram.com\\\/anaaaa.aaaaaaaaaaaaaaaaaaaaa?igsh=MWVqcmx5MXl5eGdxaw==\"],\"url\":\"https:\\\/\\\/aborrego.inscastellbisbal.net\\\/en\\\/author\\\/aborrego\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Recollir informaci\u00f3 via web scraping - Ana Borrego Toledo","robots":{"index":"noindex","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"og_locale":"en_US","og_type":"article","og_title":"Recollir informaci\u00f3 via web scraping - Ana Borrego Toledo","og_description":"RECOLLIR INFORMACI\u00d3 VIA WEB SCRAPING Tornar al repte 1.6 Primer, vaig haver d\u2019acabar d\u2019afegir tota la informaci\u00f3 al codi fins que vaig aconseguir que les respostes del xatbot fossin del meu gust. Despr\u00e9s, vaig ajustar algunes altres configuracions. Com que vaig voler que el meu xatbot simul\u00e9s que era jo, vaig canviar el seu nom a Ana i vaig fer altres modificacions a les configuracions. Un cop fet aix\u00f2, vaig preguntar a ChatGPT com fer web scraping, adjuntant-li tot el codi que tenia fins aquell moment. Com que la meva p\u00e0gina \u00e9s HTML est\u00e0tic, vaig haver d\u2019instal\u00b7lar la llibreria BeautifulSoup per poder extreure informaci\u00f3. \u00a0Un cop instal\u00b7lada, vaig verificar que tot estigu\u00e9s correctament configurat. Despr\u00e9s, vaig afegir el codi que ChatGPT em va proporcionar, situant-lo abans del bucle del xat. Aquest codi permetia extreure el text de la p\u00e0gina, obtenir tots els enlla\u00e7os i veure els encap\u00e7alaments.","og_url":"https:\/\/aborrego.inscastellbisbal.net\/en\/2025\/03\/17\/recollir-informacio-via-web-scraping\/","og_site_name":"Ana Borrego Toledo","article_published_time":"2025-03-17T11:03:45+00:00","article_modified_time":"2025-03-25T12:24:10+00:00","og_image":[{"width":1745,"height":256,"url":"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303.png","type":"image\/png"}],"author":"aborrego","twitter_card":"summary_large_image","twitter_misc":{"Written by":"aborrego","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/#article","isPartOf":{"@id":"https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/"},"author":{"name":"aborrego","@id":"https:\/\/aborrego.inscastellbisbal.net\/#\/schema\/person\/596b5d3dc4735c43cfaafed47741b9d1"},"headline":"Recollir informaci\u00f3 via web scraping","datePublished":"2025-03-17T11:03:45+00:00","dateModified":"2025-03-25T12:24:10+00:00","mainEntityOfPage":{"@id":"https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/"},"wordCount":165,"publisher":{"@id":"https:\/\/aborrego.inscastellbisbal.net\/#organization"},"image":{"@id":"https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/#primaryimage"},"thumbnailUrl":"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303-1024x150.png","articleSection":["Repte 1.6","\ud83e\udd16- Disseny i aplicaci\u00f3 de la IA"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/","url":"https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/","name":"Recollir informaci\u00f3 via web scraping - Ana Borrego Toledo","isPartOf":{"@id":"https:\/\/aborrego.inscastellbisbal.net\/#website"},"primaryImageOfPage":{"@id":"https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/#primaryimage"},"image":{"@id":"https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/#primaryimage"},"thumbnailUrl":"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303-1024x150.png","datePublished":"2025-03-17T11:03:45+00:00","dateModified":"2025-03-25T12:24:10+00:00","breadcrumb":{"@id":"https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/#primaryimage","url":"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303-1024x150.png","contentUrl":"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303-1024x150.png"},{"@type":"BreadcrumbList","@id":"https:\/\/aborrego.inscastellbisbal.net\/2025\/03\/17\/recollir-informacio-via-web-scraping\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/aborrego.inscastellbisbal.net\/"},{"@type":"ListItem","position":2,"name":"Recollir informaci\u00f3 via web scraping"}]},{"@type":"WebSite","@id":"https:\/\/aborrego.inscastellbisbal.net\/#website","url":"https:\/\/aborrego.inscastellbisbal.net\/","name":"Ana Borrego Toledo","description":"","publisher":{"@id":"https:\/\/aborrego.inscastellbisbal.net\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/aborrego.inscastellbisbal.net\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/aborrego.inscastellbisbal.net\/#organization","name":"Ana Borrego Toledo","url":"https:\/\/aborrego.inscastellbisbal.net\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/aborrego.inscastellbisbal.net\/#\/schema\/logo\/image\/","url":"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/cropped-Selection_255.png","contentUrl":"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/cropped-Selection_255.png","width":549,"height":328,"caption":"Ana Borrego Toledo"},"image":{"@id":"https:\/\/aborrego.inscastellbisbal.net\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/aborrego.inscastellbisbal.net\/#\/schema\/person\/596b5d3dc4735c43cfaafed47741b9d1","name":"aborrego","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/e158785622f9364a28d322a7a8711e2ea34ebb67263b57ff915d7b49cb3cf0d1?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/e158785622f9364a28d322a7a8711e2ea34ebb67263b57ff915d7b49cb3cf0d1?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e158785622f9364a28d322a7a8711e2ea34ebb67263b57ff915d7b49cb3cf0d1?s=96&d=mm&r=g","caption":"aborrego"},"sameAs":["https:\/\/aborrego.inscastellbisbal.net","https:\/\/www.instagram.com\/anaaaa.aaaaaaaaaaaaaaaaaaaaa?igsh=MWVqcmx5MXl5eGdxaw=="],"url":"https:\/\/aborrego.inscastellbisbal.net\/en\/author\/aborrego\/"}]}},"_links":{"self":[{"href":"https:\/\/aborrego.inscastellbisbal.net\/en\/wp-json\/wp\/v2\/posts\/5977","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/aborrego.inscastellbisbal.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/aborrego.inscastellbisbal.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/aborrego.inscastellbisbal.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/aborrego.inscastellbisbal.net\/en\/wp-json\/wp\/v2\/comments?post=5977"}],"version-history":[{"count":13,"href":"https:\/\/aborrego.inscastellbisbal.net\/en\/wp-json\/wp\/v2\/posts\/5977\/revisions"}],"predecessor-version":[{"id":6002,"href":"https:\/\/aborrego.inscastellbisbal.net\/en\/wp-json\/wp\/v2\/posts\/5977\/revisions\/6002"}],"wp:attachment":[{"href":"https:\/\/aborrego.inscastellbisbal.net\/en\/wp-json\/wp\/v2\/media?parent=5977"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/aborrego.inscastellbisbal.net\/en\/wp-json\/wp\/v2\/categories?post=5977"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/aborrego.inscastellbisbal.net\/en\/wp-json\/wp\/v2\/tags?post=5977"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}