New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

added micro_blog #3268

Draft

meatball133 wants to merge 4 commits into exercism:main from meatball133:microblog

Member

meatball133 commented Dec 5, 2022

No description provided.

meatball133 and others added 4 commits

December 5, 2022 09:40


          added micro_blog

c131885


          Merge branch 'exercism:main' into microblog

d6fd1ba


          Updated jinja

be58f08

fix

2176d0d

BethanyG added paused and removed paused labels

Contributor

vaeng commented Feb 1, 2023

Are you still working on this @meatball133? I would be happy to help if you want any.

Member

BethanyG commented Feb 1, 2023

Hi @vaeng 👋🏽

Thank you for your interest. 😄

The open PRs here are drafts of work that have been pre-agreed. @meatball133 and I are still working through them. Overall, wider community contributions have been paused for this track until at least May/June. But if you have issues or proposals, we will be happy to discuss them in the exercism forum.

BethanyG reviewed

View reviewed changes

exercises/practice/micro-blog/.docs/instructions.md

+              - **ASCII** can encode English language characters.
+                All characters are precisely 1 byte long.
+              - **UTF-8** is a Unicode text encoding.

Member

BethanyG Jun 15, 2023

Suggested change

      
            - **UTF-8** is a Unicode text encoding.
          
            - **UTF-8** is a variable-length Unicode text encoding.

BethanyG reviewed

View reviewed changes

exercises/practice/micro-blog/.docs/instructions.md

+                All characters are precisely 1 byte long.
+              - **UTF-8** is a Unicode text encoding.
+                Characters take between 1 and 4 bytes.
+              - **UTF-16** is a Unicode text encoding.

Member

BethanyG Jun 15, 2023

Suggested change

      
            - **UTF-16** is a Unicode text encoding.
          
            - **UTF-16** is also a variable-length Unicode text encoding.

BethanyG reviewed

View reviewed changes

exercises/practice/micro-blog/.docs/instructions.md

+              - **UTF-16** is a Unicode text encoding.
+                Characters are either 2 or 4 bytes long.
+              UTF-8 and UTF-16 are both Unicode encodings which means they're capable of representing a massive range of characters including:

Member

BethanyG Jun 15, 2023 •

edited

Loading

Suggested change

      
            UTF-8 and UTF-16 are both Unicode encodings which means they're capable of representing a massive range of characters including:
          
            UTF-8 and UTF-16 are both capable of representing a massive range of reader-perceived 'characters' or [graphemes][grapheme] including:

BethanyG reviewed

View reviewed changes

exercises/practice/micro-blog/.docs/instructions.md

+              Consider the letter 'a' and the emoji '😛'.
+              In UTF-16 the letter takes 2 bytes but the emoji takes 4 bytes.
+              The trick to this exercise is to use APIs designed around Unicode characters (codepoints) instead of Unicode codeunits.

Member

BethanyG Jun 15, 2023

Suggested change

      
            The trick to this exercise is to use APIs designed around Unicode characters (codepoints) instead of Unicode codeunits.
          
            The trick to this exercise is to use APIs designed around Unicode characters (codepoints) instead of Unicode codeunits.
          
            [grapheme]: https://dictionary.cambridge.org/us/dictionary/english/grapheme

BethanyG reviewed

View reviewed changes

exercises/practice/micro-blog/.docs/instructions.md

+              - Text in most of the world's languages and scripts
+              - Historic text
+              - Emoji

Member

BethanyG Jun 15, 2023

Suggested change

      
            - Emoji
          
            - Emoji
          
            - Symbols used in Physics and Mathematics

BethanyG reviewed

View reviewed changes

exercises/practice/micro-blog/.docs/instructions.md

+              - Historic text
+              - Emoji
+              UTF-8 and UTF-16 are both variable length encodings, which means that different characters take up different amounts of space.

Member

BethanyG Jun 15, 2023

Suggested change

      
            UTF-8 and UTF-16 are both variable length encodings, which means that different characters take up different amounts of space.
          
            UTF-8 and UTF-16 are both variable length encodings, which means that different graphemes can take up different amounts of space.

BethanyG reviewed

View reviewed changes

exercises/practice/micro-blog/.meta/config.json

		@@ -0,0 +1,19 @@
		{
		"blurb": "Given an input string, truncate it to 5 characters.",

Member

BethanyG Jun 15, 2023

Suggested change

      
              "blurb": "Given an input string, truncate it to 5 characters.",
          
              "blurb": "Given a Unicode input string, truncate it to 5 grapheme clusters.",

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet