Step 4: Get The Body Text From All 100 #1 Rankings
By now, you should have a list of the top 100 keywords within your niche.
Next, we need to get just the body text from each of the #1 ranking pages for each keyword. Beware: this will take some time. But it's worth it. Your competitors aren't doing stuff like this.
- Install the Google Chrome Extension called Reader View.
- If you have the URL of the first keyword in your spreadsheet, then go to that URL. Otherwise, Google is and go to the #1 ranking URL.
- Click on the Review View plugin and on the left side, settings hide images to avoid copying image alt text.
- Select all the body text and copy.
- Open up a plain text editor (avoid anything with rich formatting like Google Docs, Word, etc..) and paste in the text. This is done to remove the html formatting.
- Mark this row as complete and repeat this step for all remaining rows. What you’ll end up with is a text document filled with all of the text from every #1 ranking page for your top 100 keywords.
- Save the entire text (called a corpus) as a .txt file.
💡 Tip: Before saving the .txt you may have slightly better results if you lowercase all the words in your corpus. This may help improve the results from SketchEngine. You can use a tool like https://convertcase.net/