Skip to content

Accessing full-text using extract_fulltext method

We are developing a fulltext table for all articles across our available newspapers. Meanwhile, @thobson88 has developed an .extract_fulltext() method that can be used on any Item objects. Here is an example:

from newspapers.models import Newspaper
from newspapers.models import Item
from pathlib import Path

# Set the local download path:
Item.DOWNLOAD_DIR = Path.home() / "temp/fulltext"

# Set the SAS token:
%env FULLTEXT_SAS_TOKEN="?an=SSH&token=true"

item = Newspaper.objects.get(publication_code="0003040").issues.first().items.first()
item.extract_fulltext()

If you need help setting up a SAS token, see instructions here.

Please note, access via Blobfuse is planned but not yet implemented.