Skip to content

Commit 7c1bb4e

Browse files
committed
[gleam] complete word-count
1 parent 55d8f41 commit 7c1bb4e

File tree

7 files changed

+299
-0
lines changed

7 files changed

+299
-0
lines changed

gleam/word-count/.gitignore

+4
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
*.beam
2+
*.ez
3+
build
4+
erl_crash.dump

gleam/word-count/HELP.md

+32
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,32 @@
1+
# Help
2+
3+
## Running the tests
4+
5+
To run the tests, run the command `gleam test` from within the exercise directory.
6+
7+
## Submitting your solution
8+
9+
You can submit your solution using the `exercism submit src/word_count.gleam` command.
10+
This command will upload your solution to the Exercism website and print the solution page's URL.
11+
12+
It's possible to submit an incomplete solution which allows you to:
13+
14+
- See how others have completed the exercise
15+
- Request help from a mentor
16+
17+
## Need to get help?
18+
19+
If you'd like help solving the exercise, check the following pages:
20+
21+
- The [Gleam track's documentation](https://exercism.org/docs/tracks/gleam)
22+
- The [Gleam track's programming category on the forum](https://forum.exercism.org/c/programming/gleam)
23+
- [Exercism's programming category on the forum](https://forum.exercism.org/c/programming/5)
24+
- The [Frequently Asked Questions](https://exercism.org/docs/using/faqs)
25+
26+
Should those resources not suffice, you could submit your (incomplete) solution to request mentoring.
27+
28+
To get help if you're having trouble, you can use one of the following resources:
29+
30+
- [gleam.run](https://gleam.run/documentation/) is the gleam official documentation.
31+
- [Discord](https://discord.gg/Fm8Pwmy) is the discord channel.
32+
- [StackOverflow](https://stackoverflow.com/questions/tagged/gleam) can be used to search for your problem and see if it has been answered already. You can also ask and answer questions.

gleam/word-count/README.md

+71
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,71 @@
1+
# Word Count
2+
3+
Welcome to Word Count on Exercism's Gleam Track.
4+
If you need help running the tests or submitting your code, check out `HELP.md`.
5+
6+
## Introduction
7+
8+
You teach English as a foreign language to high school students.
9+
10+
You've decided to base your entire curriculum on TV shows.
11+
You need to analyze which words are used, and how often they're repeated.
12+
13+
This will let you choose the simplest shows to start with, and to gradually increase the difficulty as time passes.
14+
15+
## Instructions
16+
17+
Your task is to count how many times each word occurs in a subtitle of a drama.
18+
19+
The subtitles from these dramas use only ASCII characters.
20+
21+
The characters often speak in casual English, using contractions like _they're_ or _it's_.
22+
Though these contractions come from two words (e.g. _we are_), the contraction (_we're_) is considered a single word.
23+
24+
Words can be separated by any form of punctuation (e.g. ":", "!", or "?") or whitespace (e.g. "\t", "\n", or " ").
25+
The only punctuation that does not separate words is the apostrophe in contractions.
26+
27+
Numbers are considered words.
28+
If the subtitles say _It costs 100 dollars._ then _100_ will be its own word.
29+
30+
Words are case insensitive.
31+
For example, the word _you_ occurs three times in the following sentence:
32+
33+
> You come back, you hear me? DO YOU HEAR ME?
34+
35+
The ordering of the word counts in the results doesn't matter.
36+
37+
Here's an example that incorporates several of the elements discussed above:
38+
39+
- simple words
40+
- contractions
41+
- numbers
42+
- case insensitive words
43+
- punctuation (including apostrophes) to separate words
44+
- different forms of whitespace to separate words
45+
46+
`"That's the password: 'PASSWORD 123'!", cried the Special Agent.\nSo I fled.`
47+
48+
The mapping for this subtitle would be:
49+
50+
```text
51+
123: 1
52+
agent: 1
53+
cried: 1
54+
fled: 1
55+
i: 1
56+
password: 2
57+
so: 1
58+
special: 1
59+
that's: 1
60+
the: 2
61+
```
62+
63+
## Source
64+
65+
### Created by
66+
67+
- @lpil
68+
69+
### Based on
70+
71+
This is a classic toy problem, but we were reminded of it by seeing it in the Go Tour.

gleam/word-count/gleam.toml

+12
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,12 @@
1+
name = "word_count"
2+
version = "0.1.0"
3+
4+
[dependencies]
5+
gleam_bitwise = "~> 1.2"
6+
gleam_otp = "~> 0.7 or ~> 1.0"
7+
gleam_stdlib = "~> 0.32 or ~> 1.0"
8+
simplifile = "~> 1.0"
9+
gleam_erlang = ">= 0.25.0 and < 1.0.0"
10+
11+
[dev-dependencies]
12+
exercism_test_runner = "~> 1.4"

gleam/word-count/manifest.toml

+27
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
# This file was generated by Gleam
2+
# You typically do not need to edit this file
3+
4+
packages = [
5+
{ name = "argv", version = "1.0.1", build_tools = ["gleam"], requirements = [], otp_app = "argv", source = "hex", outer_checksum = "A6E9009E50BBE863EB37D963E4315398D41A3D87D0075480FC244125808F964A" },
6+
{ name = "exercism_test_runner", version = "1.7.0", build_tools = ["gleam"], requirements = ["argv", "gap", "glance", "gleam_community_ansi", "gleam_erlang", "gleam_json", "gleam_stdlib", "simplifile"], otp_app = "exercism_test_runner", source = "hex", outer_checksum = "2FC1BADB19BEC2AE77BFD2D3A606A014C85412A7B874CAFC4BA8CF04B0B257CD" },
7+
{ name = "gap", version = "1.1.0", build_tools = ["gleam"], requirements = ["gleam_community_ansi", "gleam_stdlib"], otp_app = "gap", source = "hex", outer_checksum = "2EE1B0A17E85CF73A0C1D29DA315A2699117A8F549C8E8D89FA8261BE41EDEB1" },
8+
{ name = "glance", version = "0.8.2", build_tools = ["gleam"], requirements = ["gleam_stdlib", "glexer"], otp_app = "glance", source = "hex", outer_checksum = "ACF09457E8B564AD7A0D823DAFDD326F58263C01ACB0D432A9BEFDEDD1DA8E73" },
9+
{ name = "gleam_bitwise", version = "1.3.1", build_tools = ["gleam"], requirements = [], otp_app = "gleam_bitwise", source = "hex", outer_checksum = "B36E1D3188D7F594C7FD4F43D0D2CE17561DE896202017548578B16FE1FE9EFC" },
10+
{ name = "gleam_community_ansi", version = "1.4.0", build_tools = ["gleam"], requirements = ["gleam_community_colour", "gleam_stdlib"], otp_app = "gleam_community_ansi", source = "hex", outer_checksum = "FE79E08BF97009729259B6357EC058315B6FBB916FAD1C2FF9355115FEB0D3A4" },
11+
{ name = "gleam_community_colour", version = "1.3.0", build_tools = ["gleam"], requirements = ["gleam_stdlib"], otp_app = "gleam_community_colour", source = "hex", outer_checksum = "A49A5E3AE8B637A5ACBA80ECB9B1AFE89FD3D5351FF6410A42B84F666D40D7D5" },
12+
{ name = "gleam_erlang", version = "0.25.0", build_tools = ["gleam"], requirements = ["gleam_stdlib"], otp_app = "gleam_erlang", source = "hex", outer_checksum = "054D571A7092D2A9727B3E5D183B7507DAB0DA41556EC9133606F09C15497373" },
13+
{ name = "gleam_json", version = "1.0.0", build_tools = ["gleam"], requirements = ["gleam_stdlib", "thoas"], otp_app = "gleam_json", source = "hex", outer_checksum = "8B197DD5D578EA6AC2C0D4BDC634C71A5BCA8E7DB5F47091C263ECB411A60DF3" },
14+
{ name = "gleam_otp", version = "0.10.0", build_tools = ["gleam"], requirements = ["gleam_erlang", "gleam_stdlib"], otp_app = "gleam_otp", source = "hex", outer_checksum = "0B04FE915ACECE539B317F9652CAADBBC0F000184D586AAAF2D94C100945D72B" },
15+
{ name = "gleam_stdlib", version = "0.36.0", build_tools = ["gleam"], requirements = [], otp_app = "gleam_stdlib", source = "hex", outer_checksum = "C0D14D807FEC6F8A08A7C9EF8DFDE6AE5C10E40E21325B2B29365965D82EB3D4" },
16+
{ name = "glexer", version = "0.7.0", build_tools = ["gleam"], requirements = ["gleam_stdlib"], otp_app = "glexer", source = "hex", outer_checksum = "4484942A465482A0A100936E1E5F12314DB4B5AC0D87575A7B9E9062090B96BE" },
17+
{ name = "simplifile", version = "1.5.0", build_tools = ["gleam"], requirements = ["gleam_stdlib"], otp_app = "simplifile", source = "hex", outer_checksum = "EB9AA8E65E5C1E3E0FDCFC81BC363FD433CB122D7D062750FFDF24DE4AC40116" },
18+
{ name = "thoas", version = "0.4.1", build_tools = ["rebar3"], requirements = [], otp_app = "thoas", source = "hex", outer_checksum = "4918D50026C073C4AB1388437132C77A6F6F7C8AC43C60C13758CC0ADCE2134E" },
19+
]
20+
21+
[requirements]
22+
exercism_test_runner = { version = "~> 1.4" }
23+
gleam_bitwise = { version = "~> 1.2" }
24+
gleam_erlang = { version = ">= 0.25.0 and < 1.0.0"}
25+
gleam_otp = { version = "~> 0.7 or ~> 1.0" }
26+
gleam_stdlib = { version = "~> 0.32 or ~> 1.0" }
27+
simplifile = { version = "~> 1.0" }

gleam/word-count/src/word_count.gleam

+14
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
import gleam/dict.{type Dict}
2+
import gleam/list
3+
import gleam/regex
4+
import gleam/string
5+
6+
pub fn count_words(input: String) -> Dict(String, Int) {
7+
let assert Ok(re) = regex.from_string("\\w+('\\w+)?")
8+
9+
input
10+
|> string.lowercase
11+
|> regex.scan(re, _)
12+
|> list.group(fn(match) { match.content })
13+
|> dict.map_values(fn(_, matches) { list.length(matches) })
14+
}
+139
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,139 @@
1+
import exercism/should
2+
import exercism/test_runner
3+
import gleam/dict
4+
import word_count
5+
6+
pub fn main() {
7+
test_runner.main()
8+
}
9+
10+
pub fn count_one_word_test() {
11+
"word"
12+
|> word_count.count_words
13+
|> should.equal(dict.from_list([#("word", 1)]))
14+
}
15+
16+
pub fn count_one_of_each_word_test() {
17+
"one of each"
18+
|> word_count.count_words
19+
|> should.equal(dict.from_list([#("one", 1), #("of", 1), #("each", 1)]))
20+
}
21+
22+
pub fn multiple_occurrences_of_a_word_test() {
23+
"one fish two fish red fish blue fish"
24+
|> word_count.count_words
25+
|> should.equal(
26+
dict.from_list([
27+
#("one", 1),
28+
#("fish", 4),
29+
#("two", 1),
30+
#("red", 1),
31+
#("blue", 1),
32+
]),
33+
)
34+
}
35+
36+
pub fn handles_cramped_lists_test() {
37+
"one,two,three"
38+
|> word_count.count_words
39+
|> should.equal(dict.from_list([#("one", 1), #("two", 1), #("three", 1)]))
40+
}
41+
42+
pub fn handles_expanded_lists_test() {
43+
"one,\ntwo,\nthree"
44+
|> word_count.count_words
45+
|> should.equal(dict.from_list([#("one", 1), #("two", 1), #("three", 1)]))
46+
}
47+
48+
pub fn ignore_punctuation_test() {
49+
"car: carpet as java: javascript!!&@$%^&"
50+
|> word_count.count_words
51+
|> should.equal(
52+
dict.from_list([
53+
#("car", 1),
54+
#("carpet", 1),
55+
#("as", 1),
56+
#("java", 1),
57+
#("javascript", 1),
58+
]),
59+
)
60+
}
61+
62+
pub fn include_numbers_test() {
63+
"testing, 1, 2 testing"
64+
|> word_count.count_words
65+
|> should.equal(dict.from_list([#("testing", 2), #("1", 1), #("2", 1)]))
66+
}
67+
68+
pub fn normalize_case_test() {
69+
"go Go GO Stop stop"
70+
|> word_count.count_words
71+
|> should.equal(dict.from_list([#("go", 3), #("stop", 2)]))
72+
}
73+
74+
pub fn with_apostrophes_test() {
75+
"'First: don't laugh. Then: don't cry. You're getting it.'"
76+
|> word_count.count_words
77+
|> should.equal(
78+
dict.from_list([
79+
#("first", 1),
80+
#("don't", 2),
81+
#("laugh", 1),
82+
#("then", 1),
83+
#("cry", 1),
84+
#("you're", 1),
85+
#("getting", 1),
86+
#("it", 1),
87+
]),
88+
)
89+
}
90+
91+
pub fn with_quotations_test() {
92+
"Joe can't tell between 'large' and large."
93+
|> word_count.count_words
94+
|> should.equal(
95+
dict.from_list([
96+
#("joe", 1),
97+
#("can't", 1),
98+
#("tell", 1),
99+
#("between", 1),
100+
#("large", 2),
101+
#("and", 1),
102+
]),
103+
)
104+
}
105+
106+
pub fn substrings_from_the_beginning_test() {
107+
"Joe can't tell between app, apple and a."
108+
|> word_count.count_words
109+
|> should.equal(
110+
dict.from_list([
111+
#("joe", 1),
112+
#("can't", 1),
113+
#("tell", 1),
114+
#("between", 1),
115+
#("app", 1),
116+
#("apple", 1),
117+
#("and", 1),
118+
#("a", 1),
119+
]),
120+
)
121+
}
122+
123+
pub fn multiple_spaces_not_detected_as_a_word_test() {
124+
" multiple whitespaces"
125+
|> word_count.count_words
126+
|> should.equal(dict.from_list([#("multiple", 1), #("whitespaces", 1)]))
127+
}
128+
129+
pub fn alternating_word_separators_not_detected_as_a_word_test() {
130+
",\n,one,\n ,two \n 'three'"
131+
|> word_count.count_words
132+
|> should.equal(dict.from_list([#("one", 1), #("two", 1), #("three", 1)]))
133+
}
134+
135+
pub fn quotation_for_word_with_apostrophe_test() {
136+
"can, can't, 'can't'"
137+
|> word_count.count_words
138+
|> should.equal(dict.from_list([#("can", 1), #("can't", 2)]))
139+
}

0 commit comments

Comments
 (0)