This library allows you to pass multiple regular expressions and a string and get values back.
Let's say you have a text that is:
Del 5 av 6. Shakespeare är mycket nöjd med sin senaste pjäs, Så tuktas en argbigga. Men av någon anledning uppskattas inte berättelsen om hur en stark kvinna förnedras av en man av kvinnorna i Shakespeares närhet. Originaltitel: Upstart Crow. Produktion: BBC 2017. First we would split the text into an array based on \n and . so that we can loop over the long text, as our matches only returns the first match back.
Then you would do:
StringMatcher.new() |> StringMatcher.add_regexp( ~r/Del\s+(?<episode_num>[0-9]+?)\s+av\s+(?<of_episodes>[0-9]+?)/i, %{} ) |> StringMatcher.add_regexp(~r/Originaltitel: (?<original_title>.*)\./i, %{}) |> StringMatcher.add_regexp( ~r/Produktion: (?<production_company>.*?) (?<production_year>[0-9]+)\./i, %{} ) |> StringMatcher.match_captures(string)This should return a tuple with a map. The map is returned value of the regular expressions. If no match is found you will receive {:error, "no match"}
If available in Hex, the package can be installed by adding string_matcher to your list of dependencies in mix.exs:
def deps do [ {:string_matcher, "~> 0.1.0"} ] endDocumentation can be generated with ExDoc and published on HexDocs. Once published, the docs can be found at https://hexdocs.pm/string_matcher.
Currently the tests are failing for some reason, the library is working though and is stable. It's used in production.