Skip to main content

What to do with 6 MIL + pages

I’m in-house SEO working for a really old company that’s been around for decades and has lots of different facets to it.

There’s a real legacy issue with the website. Hundreds of people have had access and autonomy over the site, and there’s so much crap. They’ve also used the site as an intranet and - though they’re pretty hard to find - there’s notes from meetings, HR docs and so much more live on the site.

I’m running a crawl now, after I noticed I’ve got www pages linking to non www pages. So I need to get everything on the same domain. I can’t do that until I know the extent of the issue, historically I’ve not been able to crawl the full site just because of time restraints.

So I’ve always crawled specific subfolders and tackled deadweight in stages. Now I want to bite the bullet and do a full crawl because I want the full picture. But it’s onto 6 million pages now (and counting) excl. images obviously.

When this is done, how do I even go about exporting this? Surely excel and google sheets can’t handle that much data? Any advice around this would be amazing.

Thank you!

PS using Screaming Frog

submitted by /u/Sick_Turtle
[link] [comments]

from Search Engine Optimization: The Latest SEO News https://ift.tt/3jCi7ce

Comments

Popular posts from this blog

Wordpress Tag Div Composer

Can somebody help me with scroll to class ( call to action ) button for long article, I wanna drive readers to specific place on the page… I keep adding the name and save then click on it but it’s not working? I cleared cache and still not working… Help please! submitted by /u/NadaGamalEldean [link] [comments] from Search Engine Optimization: The Latest SEO News https://ift.tt/3Bi5uf2

HTTP-HTTPS redirects: domain level or each URL?

Googling for answers seems to give conflicting answers here. We have a domain that has been HTTP but some individual URLs have been HTTPS (like form pages). We are about to deploy HTTPS redirect on the server to get everything going to https://www.domain.com . Do we have to go to the effort of a big 301 redirect list of all page-level URLs without Google dropping our earned page rank? Thanks all! submitted by /u/runtmc2 [link] [comments] from Search Engine Optimization: The Latest SEO News https://ift.tt/31dJ82b

One website, two properties (domains)

I wasn't sure how to frame that title so let me explain; Due to error on my part, I have ended up with, a www.xyz.com , version of my website, and one that's just xyz.com. Now I'm tracking my traffic differently on GSC and GA. I'm not sure how to feel about this. Is there anyone with a similar situation? Should I just go on like usual or are there things I need to start doing differently? Kindly advise. submitted by /u/Blessed_Dude_101010 [link] [comments] from Search Engine Optimization: The Latest SEO News https://ift.tt/87Z2MRA