Does JupyterLab language registry's addLanguage support LRLanguage as input?

jaysun_n · June 11, 2024, 10:52pm

I am trying to add a syntax highlighter and after searching for how JupyterLab declares the default languages, I came across wrapping a LRLanguage from @lezer/generator in the JupyterLab source. Is this a supported way to add a new language support for extensions? I made a basic LRLanguage using the same format as in that file but if I create a fenced code block it doesn’t highlight my code. I am getting no errors in the inspector.

Can I get some help determining what is wrong? My code is posted here: algorithm_plugin.ts, highlight.ts.

krassowski · June 12, 2024, 10:40am

There are two code highlighters: for code and for markdown. What you did should get you code in a file or code cell highlighted, but for markdown it is a bit more complex (I assume that by “fenced code block” you mean using it in markdown - is this correct)?

jaysun_n · June 12, 2024, 11:30am

Yes, I am looking for markdown highlighting. I do not need file or code cell highlighting. My testing fenced code is simply:

``` algorithm
Hello
```

fcollonval · June 14, 2024, 3:27pm

Could you check that the code block is parsed by your language (you can look at the Dom nodes structure for that)?

Normally that should work. What I think will not work is the highlights because you need to match your token with highlighted token:

github.com

jupyterlab/jupyterlab/blob/9c0c1e0c061b5dfebf5d0ad343e0464b77186bce/packages/codemirror/src/theme.ts#L93


      
              backgroundColor: 'var(--jp-layout-color1)'
            },
          
            '.cm-builtin': {
              color: 'var(--jp-mirror-editor-builtin-color)'
            }
          });
          
          // The list of available tags for syntax highlighting is available at
          // https://lezer.codemirror.net/docs/ref/#highlight.tags
          export const jupyterHighlightStyle = HighlightStyle.define([
            // Order matters - a rule will override the previous ones; important for example for in headings styles.
            { tag: t.meta, color: 'var(--jp-mirror-editor-meta-color)' },
            { tag: t.heading, color: 'var(--jp-mirror-editor-header-color)' },
            {
              tag: [t.heading1, t.heading2, t.heading3, t.heading4],
              color: 'var(--jp-mirror-editor-header-color)',
              fontWeight: 'bold'
            },
            {
              tag: t.keyword,

If I’m correct, what you will need to highlight you token (aka your tag) is to provide a custom codemirror theme through using that lab plugin

github.com

jupyterlab/jupyterlab/blob/9c0c1e0c061b5dfebf5d0ad343e0464b77186bce/packages/codemirror-extension/src/services.tsx#L94


      
              return languages;
            }
          };
          
          /**
           * CodeMirror theme registry provider.
           */
          export const themePlugin: JupyterFrontEndPlugin<IEditorThemeRegistry> = {
            id: '@jupyterlab/codemirror-extension:themes',
            description: 'Provides the CodeMirror theme registry',
            provides: IEditorThemeRegistry,
            optional: [ITranslator],
            activate: (app: JupyterFrontEnd, translator: ITranslator | null) => {
              const themes = new EditorThemeRegistry();
              // Register default themes
              for (const theme of EditorThemeRegistry.getDefaultThemes(translator)) {
                themes.addTheme(theme);
              }
              return themes;
            }
          };

lilysam · June 15, 2024, 11:57am

Hello,
You then create a JupyterLab extension that registers your language with CodeMirror using specific MIME types and file extensions. Ensure your language identifier matches in fenced code blocks for proper integration, and monitor the browser console for any errors to debug effectively.
Thanks

jaysun_n · June 15, 2024, 12:14pm

Looking at the inspector, I can see the HTML and it seems to be appropriately as “algorithm” which is the name I’ve been using.

So assuming I am parsing my fenced code correctly, I should just need to provide a syntax highlighter scheme to CodeMirror which should then get applied to my parsed code?

fcollonval · June 15, 2024, 12:42pm

Yes

Or easier, you use token tags that are already highlighted. It means you need to map your token to known tags; see Lezer Setup Example

A list of known tags is available there:
https://lezer.codemirror.net/docs/ref/#highlight.tags

Note: not all tags get highlighted in the default Jupyterlab codemirror theme. Hence the possible need for you to provide your own theme.

jaysun_n · June 17, 2024, 12:25pm

I’ve modified my extension to add a custom theme to the theme registry, but it still isn’t working. I updating my Lezer highlighter to use tags seen in jupyterHighlightStyle so I can reuse it. I am seeing no errors or warnings on compilation or when I create fenced code in Markdown using my above template. When I create a .algorithm file for testing, I see that the Algorithm language is chosen, but I don’t see any highlighting.

Can you expand on what you mean about checking the DOM to ensure the language is getting properly recognized? I worry that while the language is recognized, the parser is failing somehow. To make testing easier I changed the grammar to be the Lezer example.

Here is my updated plugin and highlighter.

fcollonval · June 17, 2024, 3:09pm

I forgot about it. But actually we have a test for that in lab that should help you:

github.com

jupyterlab/jupyterlab/blob/5e4cae5d0b3b4f080caf7afffbfb381bdf80cc5b/packages/codemirror/test/language.spec.ts#L51


      
          
            it('should default to null', async () => {
              const spec = (await languages.getLanguage('this is not a mode'))!;
              expect(spec.name).toBe('none');
            });
          });
          
          describe('#highlight', () => {
            it('should load a defined spec', async () => {
              const container = document.createElement('pre');
              await languages.highlight(
                `(defun check-login (name password) ; absolutely secure
            (if (equal name "admin")
              (equal password "12345")
              #t))`,
                languages.findBest('text/foo'),
                container
              );
              expect(container.innerHTML).toEqual(
                `<span class="ͼ19">(</span>defun check-login <span class="ͼ19">(</span>name password<span class="ͼ19">)</span> <span class="ͼ11">; absolutely secure</span>
            <span class="ͼ19">(</span>if <span class="ͼ19">(</span>equal name <span class="ͼ12">"admin"</span><span class="ͼ19">)</span>

The span elements with weird class names are applied by the highlighter.

That test uses a custom language defines in

Then we produce the JavaScript file for it:

github.com

jupyterlab/jupyterlab/blob/5e4cae5d0b3b4f080caf7afffbfb381bdf80cc5b/packages/codemirror/package.json#L31


      
          },
          "files": [
            "lib/**/*.{d.ts,js,js.map}",
            "style/*.css",
            "typings/codemirror/*.d.ts",
            "style/index.js",
            "src/**/*.{ts,tsx}"
          ],
          "scripts": {
            "build": "tsc -b",
            "build:test": "lezer-generator test/foo.grammar -o test/foo.js && tsc --build tsconfig.test.json",
            "clean": "rimraf lib && rimraf tsconfig.tsbuildinfo",
            "test": "jest",
            "test:cov": "jest --collect-coverage",
            "test:debug": "node --inspect-brk ../../node_modules/.bin/jest --runInBand",
            "test:debug:watch": "node --inspect-brk ../../node_modules/.bin/jest --runInBand --watch",
            "watch": "tsc -b --watch"
          },
          "dependencies": {
            "@codemirror/autocomplete": "^6.16.0",
            "@codemirror/commands": "^6.5.0",

In your case, I’m a bit worry about your syntax loading highlight.ts within a snippet.

jaysun_n · June 18, 2024, 1:05pm

I’ll install jest, (figure out how to use it it,) and try to make the foo test case example you posted in my project.

In your case, I’m a bit worry about your syntax loading highlight.ts within a snippet.

What do you mean by this? Do you mean how I am adding the syntax highlighting in the .grammar given to Lezer?

fcollonval · June 18, 2024, 3:06pm

What do you mean by this? Do you mean how I am adding the syntax highlighting in the .grammar given to Lezer?

To be included in JupyterLab, your JavaScript assets are bundled using webpack under the wood. The link within the CodeMirror lezer grammar is definitely ignored by webpack. So I’m not sure the code get properly packaged.

jaysun_n · June 20, 2024, 1:00pm

So using your example I was able to create a Markdown parser, and while it compiled correctly once, when I try to re-compile my project I get a type error complaining about how 2 of the same types aren’t actually the same. I am not terribly familiar with TypeScript but it seems it is because I am getting the LRParser from @jupyterlab/codemirror and @lezer/lr but I am not sure how I can convince Typescript they are the same.

Here is the error I have been getting.

fcollonval · June 20, 2024, 7:11pm

Hey

assuming you use jlpm, you can solve this by adding an entry resolutions in your package.json:

resolutions: {

@lezer/lr: “^1.4.0”

},

jaysun_n · June 21, 2024, 1:24am

I am using jlpm. I manually added the line you mentioned but am still getting the same error even when doing a clean build. Is there anything additional I need to do?

fcollonval · June 21, 2024, 5:18am

You can try running

jlpm dlx yarn-berry-deduplicate
jlpm

Could you share your package.json?

jaysun_n · June 21, 2024, 6:19am

I ran those commands and now am seeing the following in addition to the previous errors:

error TS2688: Cannot find type definition file for 'cookie'.
  The file is in the program because:
    Entry point for implicit type library 'cookie'

error TS2688: Cannot find type definition file for 'cors'.
  The file is in the program because:
    Entry point for implicit type library 'cors'

error TS2688: Cannot find type definition file for 'karma'.
  The file is in the program because:
    Entry point for implicit type library 'karma'

Here is my package.json.

fcollonval · June 21, 2024, 10:40am

As you are not very familiar with typescript, you can take a shortcut of adding

“skipLibCheck”: true

In tsconfig.json

This will tell typescript to not validate the types of your dependencies

jaysun_n · June 22, 2024, 12:18pm

~~That got it compiling yesterday but I opened my project and now the project compiles but I am getting the following error. This is when I try to open the file that contains my custom syntax.~~ I had an issue in my index.ts where somewhere in my git rebasing I lost the part where I actually provide the extension.

~~The only thing different is that my laptop was restarted between yesterdy and today.~~ Do you know if there are any Jupyter scripts or options I can use to find the source of this error?

Thank you so much for your help.

jaysun_n · June 22, 2024, 12:44pm

How do I use the new CodeMirror theme I create? I added one with the following code but I am not seeing any updates to my syntax highlighting. Do I need to associate the language with the theme?

/**
 * Initialization data for the algorithm theme extension.
 */
export const algorithmtheme_plugin: JupyterFrontEndPlugin<void> = {
  id: 'unicodelab-ts:algorithmtheme',
  autoStart: true,
  // provides: snippetToken, // For providing services
  requires: [IEditorThemeRegistry],
  optional: [],
  activate: (
    app: JupyterFrontEnd,
    theme_registry: IEditorThemeRegistry
  ) => {

    theme_registry.addTheme({
      name: 'algorithm',
      displayName: 'Algorithm',
      theme: [
        EditorView.baseTheme({}),
        syntaxHighlighting(defaultHighlightStyle),
        syntaxHighlighting(HighlightStyle.define([
          { tag: t.number,
            color: "blue",
            textDecoration: "line-through" },
        ]))
      ]
    });
  }
};

fcollonval · June 24, 2024, 3:04pm

You should see your new theme in that menu. Click on the name to activate it:

Topic		Replies	Views
Examples of how to implement syntax highlighting JupyterLab jupyterlab , help-wanted	4	1246	June 26, 2024
Call syntax highlighting programmatically? JupyterLab	0	487	June 27, 2019
How to enable CodeMirror extension only for the target kernel/language JupyterLab	2	79	November 19, 2024
Jupyterlab inline syntax highlighting JupyterLab	4	2541	January 27, 2022
Text Editor Syntax Highlighting in Jupyter Cell JupyterLab	0	3094	June 27, 2019

Does JupyterLab language registry's addLanguage support LRLanguage as input?

Related topics