Elixir/HTML dump scraper: Difference between revisions
use Project template |
some examples |
||
| Line 5: | Line 5: | ||
{{Project|status=(in progress)|url=https://gitlab.com/wmde/technical-wishes/scrape-wiki-html-dump}} | {{Project|status=(in progress)|url=https://gitlab.com/wmde/technical-wishes/scrape-wiki-html-dump}} | ||
=== What are references? === | |||
References are the little footnotes all over Wikipedia articles.<ref>Like this one.</ref> Citations are used to ground the writing in sources, which are especially important on Wikipedia because of the rule against so-called "original research". Everything needs to be paraphrased from existing secondary sources. | |||
A raw reference looks like <code><nowiki><ref>This footnote.</ref></nowiki></code>. But most references are fancier, and rely on reusable structures called templates. They get long, but let's take a simple example <code><nowiki>{{sfn|Burgess|2011|p=290}}</nowiki></code> | |||
=== Challenges of wikitext === | |||