2020年1月3日 23:01:41go评论170阅读模式

英文:

Function Based Index not improving query performance

问题

I have created a view and a function-based index based on the provided code. The view is named "TST_AGG" and contains a case statement that processes the "ticker" column. The function-based index is named "IDX_ROOT" and is created on the "TICKER" column in the "TTT_IMP" table. If you need further assistance or have specific questions about this code, please let me know.

英文:

I have created view and in this view i have added the below case statement which i need and for which i already create exactly the function based index. The view has 1900000 records. When i tried to execute the view it takes hours to run and the performance of this view is very low. I dont understand how can i improve the performance.

CREATE OR REPLACE VIEW
    TST_AGG
    (
    ROOT) AS
    Select
     CASE
                WHEN regexp_like(ticker, &#39;\s.*\s&#39;)
                THEN SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1)
                WHEN regexp_like(ticker, &#39;\s&#39;)
                THEN
                    CASE
                        WHEN regexp_like(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1), &#39;(P|C)$&#39;)
                        AND LENGTH(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1)) &gt;= 4
                        THEN SUBSTR(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1), 1, LENGTH(SUBSTR(ticker, 1
                            , instr(ticker, &#39; &#39;)-1))-3)
                        WHEN regexp_like(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1), &#39;\w\d\d\w\d$&#39;)
                        THEN SUBSTR(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1), 1, LENGTH(SUBSTR(ticker, 1
                            , instr(ticker, &#39; &#39;)-1))-5)
                        WHEN regexp_like(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)), &#39;\w\d\w\d$&#39;)
                        THEN SUBSTR(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1), 1, LENGTH(SUBSTR(ticker, 1
                            , instr(ticker, &#39; &#39;)-1))-4)
                        ELSE SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1)
                    END
                WHEN regexp_like(ticker, &#39;(P|C)$&#39;)
                AND LENGTH(ticker) &gt;= 4
                THEN SUBSTR(ticker, 1, LENGTH(ticker)-3)
                WHEN regexp_like(ticker, &#39;\w\d\d\w\d$&#39;)
                THEN SUBSTR(ticker, 1, LENGTH(ticker)-5)
                WHEN regexp_like(ticker, &#39;\w\d\w\d$&#39;)
                THEN SUBSTR(ticker, 1, LENGTH(ticker)-4)
                ELSE ticker
            END ) AS ROOT
FROM TTT_IMP

Below is the functional based index i have created:

CREATE INDEX &quot;IDX_ROOT&quot; ON &quot;TTT_IMP&quot; (CASE  WHEN  REGEXP_LIKE (&quot;TICKER&quot;,&#39;\s.*\s&#39;) THEN SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)-1) WHEN  REGEXP_LIKE (&quot;TICKER&quot;,&#39;\s&#39;) THEN CASE  WHEN ( REGEXP_LIKE (SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)-1),&#39;(P|C)$&#39;) AND LENGTH(SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)-1))&gt;=4) THEN SUBSTR(SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)-1),1,LENGTH(SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)-1))-3) WHEN  REGEXP_LIKE (SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)-1),&#39;\w\d\d\w\d$&#39;) THEN SUBSTR(SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)-1),1,LENGTH(SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)-1))-5) WHEN  REGEXP_LIKE (SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)),&#39;\w\d\w\d$&#39;) THEN SUBSTR(SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)-1),1,LENGTH(SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)-1))-4) ELSE SUBSTR(&quot;TICKER&quot;,1,INSTR(&quot;TICKER&quot;,&#39; &#39;)-1) END  WHEN ( REGEXP_LIKE (&quot;TICKER&quot;,&#39;(P|C)$&#39;) AND LENGTH(&quot;TICKER&quot;)&gt;=4) THEN SUBSTR(&quot;TICKER&quot;,1,LENGTH(&quot;TICKER&quot;)-3) WHEN  REGEXP_LIKE (&quot;TICKER&quot;,&#39;\w\d\d\w\d$&#39;) THEN SUBSTR(&quot;TICKER&quot;,1,LENGTH(&quot;TICKER&quot;)-5) WHEN  REGEXP_LIKE (&quot;TICKER&quot;,&#39;\w\d\w\d$&#39;) THEN SUBSTR(&quot;TICKER&quot;,1,LENGTH(&quot;TICKER&quot;)-4) ELSE &quot;TICKER&quot; END );

答案1

得分: 4

索引可以在两种情况下使用。

用于减少所选记录，即

SELECT ... FROM TST_AGG where ROOT = ...
用于避免查询表格，以及避免昂贵的计算

SELECT ROOT FROM TST_AGG

我在这里假设是第二种情况。只有当优化器知道索引条目与表格是一对一的关系时，才能使用索引（代替表格）。因为它不知道该表达式是否可能返回一个NULL（这些值不存储在索引中），除非你告诉优化器，否则它无法进行直接替换。

因此

SELECT ROOT FROM TST_AGG

将无法使用索引，但

SELECT ROOT FROM TST_AGG WHERE ROOT IS NOT NULL

应该可以使用。

还要注意的一件事是，我们可能会更改用于存储索引的表达式语法。因此，请查看USER_IND_EXPRESSIONS，并可能将该表达式重新放入视图的定义中。

英文:

The index can be used in two scenarios.

To reduce the records selected, ie

SELECT ... FROM TST_AGG where ROOT = ...
To avoid querying a table, and to avoid expensive calcuations

SELECT ROOT FROM TST_AGG

I am assuming the latter here. The optimizer can only use an index (in place of a table) if it knows that the index entries are one for one with the table. Because it does not know if that expression may return a NULL (which are NOT stored in the index) it cannot make that direct swap unless you inform the optimizer.

Hence

SELECT ROOT FROM TST_AGG

will not have the chance to use the index, but

SELECT ROOT FROM TST_AGG WHERE ROOT IS NOT NULL

should be able to.

One other thing to be careful of, is that we might alter the expression syntax that we used to store the index. So check out USER_IND_EXPRESSIONS, and perhaps put that expression back into the definition of the view.

答案2

得分: 2

我建议您审查您的数据模型，正则表达式真的很难看。直接将相关信息存储在列中，而不是在“ticket”字符串的某个隐藏位置。

无论如何，我建议创建一个虚拟列而不是视图。然后，您可以在这个虚拟列上创建索引，它也应该被使用。类似于这样：

ALTER TABLE TTT_IMP ADD (ROOT VARCHAR2(20) GENERATED ALWAYS AS (
CAST(
CASE
WHEN regexp_like(ticker, '\s.*\s')
THEN SUBSTR(ticker, 1, instr(ticker, ' ')-1)
WHEN regexp_like(ticker, '\s')
THEN
CASE
WHEN regexp_like(SUBSTR(ticker, 1, instr(ticker, ' ')-1), '(P|C)$')
AND LENGTH(SUBSTR(ticker, 1, instr(ticker, ' ')-1)) >= 4
THEN SUBSTR(SUBSTR(ticker, 1, instr(ticker, ' ')-1), 1, LENGTH(SUBSTR(ticker, 1
, instr(ticker, ' ')-1))-3)
WHEN regexp_like(SUBSTR(ticker, 1, instr(ticker, ' ')-1), '\w\d\d\w\d$')
THEN SUBSTR(SUBSTR(ticker, 1, instr(ticker, ' ')-1), 1, LENGTH(SUBSTR(ticker, 1
, instr(ticker, ' ')-1))-5)
WHEN regexp_like(SUBSTR(ticker, 1, instr(ticker, ' ')), '\w\d\w\d$')
THEN SUBSTR(SUBSTR(ticker, 1, instr(ticker, ' ')-1), 1, LENGTH(SUBSTR(ticker, 1
, instr(ticker, ' ')-1))-4)
ELSE SUBSTR(ticker, 1, instr(ticker, ' ')-1)
END
WHEN regexp_like(ticker, '(P|C)$')
AND LENGTH(ticker) >= 4
THEN SUBSTR(ticker, 1, LENGTH(ticker)-3)
WHEN regexp_like(ticker, '\w\d\d\w\d$')
THEN SUBSTR(ticker, 1, LENGTH(ticker)-5)
WHEN regexp_like(ticker, '\w\d\w\d$')
THEN SUBSTR(ticker, 1, LENGTH(ticker)-4)
ELSE ticker
END
AS VARCHAR2(20))
) VIRTUAL);

英文:

I would suggest to review your data model, the regex is really ugly. Store relevant information directly in column instead of somewhere hidden in a ticket string.

Anyway, I would propose to create a virtual column instead of view. Then you can create an index on this virtual column and it should also be used. Would be similar to this:

ALTER TABLE TTT_IMP ADD (ROOT VARCHAR2(20) GENERATED ALWAYS AS (
CAST(
    CASE
    WHEN regexp_like(ticker, &#39;\s.*\s&#39;)
    THEN SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1)
    WHEN regexp_like(ticker, &#39;\s&#39;)
    THEN
        CASE
            WHEN regexp_like(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1), &#39;(P|C)$&#39;)
            AND LENGTH(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1)) &gt;= 4
            THEN SUBSTR(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1), 1, LENGTH(SUBSTR(ticker, 1
                , instr(ticker, &#39; &#39;)-1))-3)
            WHEN regexp_like(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1), &#39;\w\d\d\w\d$&#39;)
            THEN SUBSTR(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1), 1, LENGTH(SUBSTR(ticker, 1
                , instr(ticker, &#39; &#39;)-1))-5)
            WHEN regexp_like(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)), &#39;\w\d\w\d$&#39;)
            THEN SUBSTR(SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1), 1, LENGTH(SUBSTR(ticker, 1
                , instr(ticker, &#39; &#39;)-1))-4)
            ELSE SUBSTR(ticker, 1, instr(ticker, &#39; &#39;)-1)
        END
    WHEN regexp_like(ticker, &#39;(P|C)$&#39;)
    AND LENGTH(ticker) &gt;= 4
    THEN SUBSTR(ticker, 1, LENGTH(ticker)-3)
    WHEN regexp_like(ticker, &#39;\w\d\d\w\d$&#39;)
    THEN SUBSTR(ticker, 1, LENGTH(ticker)-5)
    WHEN regexp_like(ticker, &#39;\w\d\w\d$&#39;)
    THEN SUBSTR(ticker, 1, LENGTH(ticker)-4)
    ELSE ticker
    END
AS VARCHAR2(20))
) VIRTUAL);

答案3

得分: 0

Here's the translated content:

12.4.2 函数索引的缺点

如果索引表达式是函数调用，那么函数的返回类型无法约束。

因为您无法使用 NOT NULL 约束函数的返回类型，您必须确保使用该索引的查询不会获取 NULL 值。否则，数据库将执行全表扫描。

英文:

To add to @ConnorMcDonnald’s great answer, here’s a blurb from the docs:

12.4.2 Disadvantages of Function-Based Indexes

> If the index expression is a function invocation, then the function
> return type cannot be constrained.
>
> Because you cannot constrain the function return type with NOT NULL,
> you must ensure that the query that uses the index cannot fetch NULL
> values. Otherwise, the database performs a full table scan.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

基于函数的索引未能提高查询性能。

问题

答案1

答案2

答案3

如何在MySQL中转换月份

将SAS日期值（mmddyy10.）插入到Oracle表中。

SQLAlchemy 2.x：立即加载连接的集合查询

更改数据库会话上下文将随池化连接重用而持续存在。

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论