{"version":"1.0","provider_name":"Ana Borrego Toledo","provider_url":"https:\/\/aborrego.inscastellbisbal.net\/en","author_name":"aborrego","author_url":"https:\/\/aborrego.inscastellbisbal.net\/en\/author\/aborrego\/","title":"Recollir informaci\u00f3 via web scraping - Ana Borrego Toledo","type":"rich","width":600,"height":338,"html":"<blockquote class=\"wp-embedded-content\" data-secret=\"MiwhZTdIPU\"><a href=\"https:\/\/aborrego.inscastellbisbal.net\/en\/2025\/03\/17\/recollir-informacio-via-web-scraping\/\">Recollir informaci\u00f3 via web scraping<\/a><\/blockquote><iframe sandbox=\"allow-scripts\" security=\"restricted\" src=\"https:\/\/aborrego.inscastellbisbal.net\/en\/2025\/03\/17\/recollir-informacio-via-web-scraping\/embed\/#?secret=MiwhZTdIPU\" width=\"600\" height=\"338\" title=\"&#8220;Recollir informaci\u00f3 via web scraping&#8221; &#8212; Ana Borrego Toledo\" data-secret=\"MiwhZTdIPU\" frameborder=\"0\" marginwidth=\"0\" marginheight=\"0\" scrolling=\"no\" class=\"wp-embedded-content\"><\/iframe><script>\n\/*! This file is auto-generated *\/\n!function(d,l){\"use strict\";l.querySelector&&d.addEventListener&&\"undefined\"!=typeof URL&&(d.wp=d.wp||{},d.wp.receiveEmbedMessage||(d.wp.receiveEmbedMessage=function(e){var t=e.data;if((t||t.secret||t.message||t.value)&&!\/[^a-zA-Z0-9]\/.test(t.secret)){for(var s,r,n,a=l.querySelectorAll('iframe[data-secret=\"'+t.secret+'\"]'),o=l.querySelectorAll('blockquote[data-secret=\"'+t.secret+'\"]'),c=new RegExp(\"^https?:$\",\"i\"),i=0;i<o.length;i++)o[i].style.display=\"none\";for(i=0;i<a.length;i++)s=a[i],e.source===s.contentWindow&&(s.removeAttribute(\"style\"),\"height\"===t.message?(1e3<(r=parseInt(t.value,10))?r=1e3:~~r<200&&(r=200),s.height=r):\"link\"===t.message&&(r=new URL(s.getAttribute(\"src\")),n=new URL(t.value),c.test(n.protocol))&&n.host===r.host&&l.activeElement===s&&(d.top.location.href=t.value))}},d.addEventListener(\"message\",d.wp.receiveEmbedMessage,!1),l.addEventListener(\"DOMContentLoaded\",function(){for(var e,t,s=l.querySelectorAll(\"iframe.wp-embedded-content\"),r=0;r<s.length;r++)(t=(e=s[r]).getAttribute(\"data-secret\"))||(t=Math.random().toString(36).substring(2,12),e.src+=\"#?secret=\"+t,e.setAttribute(\"data-secret\",t)),e.contentWindow.postMessage({message:\"ready\",secret:t},\"*\")},!1)))}(window,document);\n\/\/# sourceURL=https:\/\/aborrego.inscastellbisbal.net\/wp-includes\/js\/wp-embed.min.js\n<\/script>","description":"RECOLLIR INFORMACI\u00d3 VIA WEB SCRAPING Tornar al repte 1.6 Primer, vaig haver d\u2019acabar d\u2019afegir tota la informaci\u00f3 al codi fins que vaig aconseguir que les respostes del xatbot fossin del meu gust. Despr\u00e9s, vaig ajustar algunes altres configuracions. Com que vaig voler que el meu xatbot simul\u00e9s que era jo, vaig canviar el seu nom a Ana i vaig fer altres modificacions a les configuracions. Un cop fet aix\u00f2, vaig preguntar a ChatGPT com fer web scraping, adjuntant-li tot el codi que tenia fins aquell moment. Com que la meva p\u00e0gina \u00e9s HTML est\u00e0tic, vaig haver d\u2019instal\u00b7lar la llibreria BeautifulSoup per poder extreure informaci\u00f3. \u00a0Un cop instal\u00b7lada, vaig verificar que tot estigu\u00e9s correctament configurat. Despr\u00e9s, vaig afegir el codi que ChatGPT em va proporcionar, situant-lo abans del bucle del xat. Aquest codi permetia extreure el text de la p\u00e0gina, obtenir tots els enlla\u00e7os i veure els encap\u00e7alaments.","thumbnail_url":"https:\/\/aborrego.inscastellbisbal.net\/wp-content\/uploads\/2025\/03\/Selection_303.png","thumbnail_width":1745,"thumbnail_height":256}