2023年6月29日 00:44:11go评论116阅读模式

英文:

How to increase dali result limitation

问题

在hpcc-ecl中，当尝试将日期大小限制从10 MB增加到约5 GB时，使用选项命令（#OPTION('outputlimitMb',5048)）时，我收到了以下错误消息。

Dali result outputs are restricted to an absolute maximum of 2000 MB 
(5048 MB specified by option). A huge dali result usually indicates the ECL needs altering.

如何重现此问题？

首先，安装HPCC-GNN捆绑包，然后运行它们的[classificationTest.ecl][1]。然后，将训练数据大小从1000增加到10,000,000。

IMPORT STD;
IMPORT Python3 AS Python;
IMPORT $.^ AS GNN;
IMPORT GNN.Tensor;
IMPORT GNN.Internal.Types AS iTypes;
IMPORT GNN.Types;
IMPORT GNN.GNNI;
IMPORT GNN.Internal AS Int;
IMPORT ML_Core AS mlc;

#OPTION(&#39;outputlimitMb&#39;,5048);

NumericField := mlc.Types.NumericField;

// Prepare training data
RAND_MAX := POWER(2,32) -1;

// Test parameters
trainCount := 100000000;
testCount := 100;
featureCount := 5;
classCount := 3;
numEpochs := 5;
batchSize := 128;
// End of Test Parameters

// Prepare training data.
// We use 5 inputs (X) and a one hot encoded output (Y) with 3 classes
// (i.e. 3 outputs).
trainRec := RECORD
  UNSIGNED8 id;
  SET OF REAL4 x;
  SET OF REAL4 y;
END;

// The target function maps a set of X features into a Y value,
// which is a threshold on a polynomial function of X.
// Note that we are effectively doing a One Hot encoding here, since we
// return a set of Y values, one for each class, with only one value
// being one and the rest zero.
// If we were working with tensors here, we could have used a class
// label and then called Utils.ToOneHot to encode it.
SET OF REAL4 targetFunc(REAL4 x1, REAL4 x2, REAL4 x3, REAL4 x4, REAL4 x5) := FUNCTION
  rslt0 := TANH(.5 * POWER(x1, 4) - .4 * POWER(x2, 3) + .3 * POWER(x3,2) - .2 * x4 + .1 * x5);
  rslt := MAP(rslt0 &gt; -.25 =&gt; [1,0,0], rslt0 &lt; .25 =&gt; [0,1,0], [0,0,1]);
  RETURN rslt;
END;

// Build the training data
train0 := DATASET(trainCount, TRANSFORM(trainRec,
                      SELF.id := COUNTER,
                      SELF.x := [(RANDOM() % RAND_MAX) / (RAND_MAX / 2) - 1,
                                  (RANDOM() % RAND_MAX) / (RAND_MAX / 2) - 1,
                                  (RANDOM() % RAND_MAX) / (RAND_MAX / 2) - 1,
                                  (RANDOM() % RAND_MAX) / (RAND_MAX / 2) - 1,
                                  (RANDOM() % RAND_MAX) / (RAND_MAX / 2) - 1],
                      SELF.y := [])
                      );
// Be sure to compute Y in a second step.  Otherewise, the RANDOM() will be executed twice and the Y will be based
// on different values than those assigned to X.  This is an ECL quirk that is not easy to fix.
train := PROJECT(train0, TRANSFORM(RECORDOF(LEFT), SELF.y := targetFunc(LEFT.x[1], LEFT.x[2], LEFT.x[3], LEFT.x[4], LEFT.x[5]), SELF := LEFT));
OUTPUT(train, NAMED(&#39;trainData&#39;));

// Build the test data.  Same process as the training data.
test0 := DATASET(testCount, TRANSFORM(trainRec,
                      SELF.id := COUNTER,
                      SELF.x := [(RANDOM() % RAND_MAX) / RAND_MAX -.5,
                                  (RANDOM() % RAND_MAX) / RAND_MAX -.5,
                                  (RANDOM() % RAND_MAX) / RAND_MAX -.5,
                                  (RANDOM() % RAND_MAX) / RAND_MAX -.5,
                                  (RANDOM() % RAND_MAX) / RAND_MAX -.5],
                      SELF.y := [])
                      );

test := PROJECT(test0, TRANSFORM(RECORDOF(LEFT), SELF.y := targetFunc(LEFT.x[1], LEFT.x[2], LEFT.x[3], LEFT.x[4], LEFT.x[5]), SELF := LEFT));

// Break the training and test data into X (independent) and Y (dependent) data sets.
// Format as NumericField data.
trainX := NORMALIZE(train, featureCount, TRANSFORM(NumericField,
                            SELF.wi := 1,
                            SELF.id := LEFT.id,
                            SELF.number := COUNTER,
                            SELF.value := LEFT.x[COUNTER]));
trainY := NORMALIZE(train, classCount, TRANSFORM(NumericField,
                            SELF.wi := 1,
                            SELF.id := LEFT.id,
                            SELF.number := COUNTER,
                            SELF.value := LEFT.y[COUNTER]));

OUTPUT(trainX, NAMED(&#39;X1&#39;));
OUTPUT(trainY, NAMED(&#39;y1&#39;));

testX := NORMALIZE(test, featureCount, TRANSFORM(NumericField,
                            SELF.wi := 1,
                            SELF.id := LEFT.id,
                            SELF.number := COUNTER,
                            SELF.value := LEFT.x[COUNTER]));
testY := NORMALIZE(test, classCount, TRANSFORM(NumericField,
                            SELF.wi := 1,
                            SELF.id := LEFT.id,
                            SELF.number := COUNTER,
                            SELF.value := LEFT.y[COUNTER]));


// ldef provides the set of Keras layers that form the neural network.  These are
// provided as strings representing the Python layer definitions as would be provided
// to Keras.  Note that the symbol &#39;tf&#39; is available for use (import tensorflow as tf), as is
// the symbol &#39;layers&#39; (from tensorflow.keras import layers).
ldef := [&#39;&#39;&#39;layers.Dense(16, activation=&#39;tanh&#39;, input_shape=(5,))&#39;&#39;&#39;,
          &#39;&#39;&#39;layers.Dense(16, activation=&#39;relu&#39;)&#39;&#39;&#39;,
          &#39;&#39;&#39;layers.Dense(3, activation=&#39;softmax&#39;)&#39;&#39;&#39;];

// compileDef defines the compile line to use for compiling the defined model.
// Note that &#39;model.&#39; is implied, and should not be included in the compile line.
compileDef := &#39;&#39;&#39;compile(optimizer=tf.keras.optimizers.SGD(.05),
              loss=tf.keras.losses.categorical_crossentropy

<details>
<summary>英文:</summary>

In hpcc-ecl, when trying to increase the date size limit from 10 MB to approx ~5 GB using the option command (`#OPTION(&#39;outputlimitMb&#39;,5048)`), I got the following error.

Dali result outputs are restricted to an absolute maximum of 2000 MB
(5048 MB specified by option). A huge dali result usually indicates the ECL needs altering.


*How to reproduce the issue?*

First, install the HPCC-GNN bundle and then run their [classificationTest.ecl][1]. After that, increase the training data size from 1000 to 10,000,000.

IMPORT STD;
IMPORT Python3 AS Python;
IMPORT $.^ AS GNN;
IMPORT GNN.Tensor;
IMPORT GNN.Internal.Types AS iTypes;
IMPORT GNN.Types;
IMPORT GNN.GNNI;
IMPORT GNN.Internal AS Int;
IMPORT ML_Core AS mlc;

#OPTION('outputlimitMb',5048);

NumericField := mlc.Types.NumericField;

// Prepare training data
RAND_MAX := POWER(2,32) -1;

// Test parameters
trainCount := 100000000;
testCount := 100;
featureCount := 5;
classCount := 3;
numEpochs := 5;
batchSize := 128;
// End of Test Parameters

// Prepare training data.
// We use 5 inputs (X) and a one hot encoded output (Y) with 3 classes
// (i.e. 3 outputs).
trainRec := RECORD
UNSIGNED8 id;
SET OF REAL4 x;
SET OF REAL4 y;
END;

// The target function maps a set of X features into a Y value,
// which is a threshold on a polynomial function of X.
// Note that we are effectively doing a One Hot encoding here, since we
// return a set of Y values, one for each class, with only one value
// being one and the rest zero.
// If we were working with tensors here, we could have used a class
// label and then called Utils.ToOneHot to encode it.
SET OF REAL4 targetFunc(REAL4 x1, REAL4 x2, REAL4 x3, REAL4 x4, REAL4 x5) := FUNCTION
rslt0 := TANH(.5 * POWER(x1, 4) - .4 * POWER(x2, 3) + .3 * POWER(x3,2) - .2 * x4 + .1 * x5);
rslt := MAP(rslt0 > -.25 => [1,0,0], rslt0 < .25 => [0,1,0], [0,0,1]);
RETURN rslt;
END;

// Build the training data
train0 := DATASET(trainCount, TRANSFORM(trainRec,
SELF.id := COUNTER,
SELF.x := [(RANDOM() % RAND_MAX) / (RAND_MAX / 2) - 1,
(RANDOM() % RAND_MAX) / (RAND_MAX / 2) - 1,
(RANDOM() % RAND_MAX) / (RAND_MAX / 2) - 1,
(RANDOM() % RAND_MAX) / (RAND_MAX / 2) - 1,
(RANDOM() % RAND_MAX) / (RAND_MAX / 2) - 1],
SELF.y := [])
);
// Be sure to compute Y in a second step. Otherewise, the RANDOM() will be executed twice and the Y will be based
// on different values than those assigned to X. This is an ECL quirk that is not easy to fix.
train := PROJECT(train0, TRANSFORM(RECORDOF(LEFT), SELF.y := targetFunc(LEFT.x[1], LEFT.x[2], LEFT.x[3], LEFT.x[4], LEFT.x[5]), SELF := LEFT));
OUTPUT(train, NAMED('trainData'));

// Build the test data. Same process as the training data.
test0 := DATASET(testCount, TRANSFORM(trainRec,
SELF.id := COUNTER,
SELF.x := [(RANDOM() % RAND_MAX) / RAND_MAX -.5,
(RANDOM() % RAND_MAX) / RAND_MAX -.5,
(RANDOM() % RAND_MAX) / RAND_MAX -.5,
(RANDOM() % RAND_MAX) / RAND_MAX -.5,
(RANDOM() % RAND_MAX) / RAND_MAX -.5],
SELF.y := [])
);

test := PROJECT(test0, TRANSFORM(RECORDOF(LEFT), SELF.y := targetFunc(LEFT.x[1], LEFT.x[2], LEFT.x[3], LEFT.x[4], LEFT.x[5]), SELF := LEFT));

// Break the training and test data into X (independent) and Y (dependent) data sets.
// Format as NumericField data.
trainX := NORMALIZE(train, featureCount, TRANSFORM(NumericField,
SELF.wi := 1,
SELF.id := LEFT.id,
SELF.number := COUNTER,
SELF.value := LEFT.x[COUNTER]));
trainY := NORMALIZE(train, classCount, TRANSFORM(NumericField,
SELF.wi := 1,
SELF.id := LEFT.id,
SELF.number := COUNTER,
SELF.value := LEFT.y[COUNTER]));

OUTPUT(trainX, NAMED('X1'));
OUTPUT(trainY, NAMED('y1'));

testX := NORMALIZE(test, featureCount, TRANSFORM(NumericField,
SELF.wi := 1,
SELF.id := LEFT.id,
SELF.number := COUNTER,
SELF.value := LEFT.x[COUNTER]));
testY := NORMALIZE(test, classCount, TRANSFORM(NumericField,
SELF.wi := 1,
SELF.id := LEFT.id,
SELF.number := COUNTER,
SELF.value := LEFT.y[COUNTER]));

// ldef provides the set of Keras layers that form the neural network. These are
// provided as strings representing the Python layer definitions as would be provided
// to Keras. Note that the symbol 'tf' is available for use (import tensorflow as tf), as is
// the symbol 'layers' (from tensorflow.keras import layers).
ldef := ['''layers.Dense(16, activation='tanh', input_shape=(5,))''',
'''layers.Dense(16, activation='relu')''',
'''layers.Dense(3, activation='softmax')'''];

// compileDef defines the compile line to use for compiling the defined model.
// Note that 'model.' is implied, and should not be included in the compile line.
compileDef := '''compile(optimizer=tf.keras.optimizers.SGD(.05),
loss=tf.keras.losses.categorical_crossentropy,
metrics=['accuracy'])
''';

// Note that the order of the GNNI functions is maintained by passing tokens returned from one call
// into the next call that is dependent on it.
// For example, s is returned from GetSession(). It is used as the input to DefineModels(...) so
// that DefineModels() cannot execute until GetSession() has completed.
// Likewise, mod, the output from GetSession() is provided as input to Fit(). Fit in turn returns
// a token that is used by GetLoss(), EvaluateMod(), and Predict(), which are only dependent on Fit()
// having completed, and are not order dependent on one another.

// GetSession must be called before any other functions
s := GNNI.GetSession();
// Define model is dependent on the Session
// ldef contains the Python definition for each Keras layer
// compileDef contains the Keras compile statement.
mod := GNNI.DefineModel(s, ldef, compileDef);
// GetWeights returns the initialized weights that have been synchronized across all nodes.
wts := GNNI.GetWeights(mod);

OUTPUT(wts, NAMED('InitWeights'));

// Fit trains the models, given training X and Y data. BatchSize is not the Keras batchSize,
// but defines how many records are processed on each node before synchronizing the weights
// Note that we use the NF form of Fit since we are using NumericField for I / o.
mod2 := GNNI.FitNF(mod, trainX, trainY, batchSize := batchSize, numEpochs := numEpochs);

OUTPUT(mod2, NAMED('mod2'));

// GetLoss returns the average loss for the final training epoch
losses := GNNI.GetLoss(mod2);

// EvaluateNF computes the loss, as well as any other metrics that were defined in the Keras
// compile line. This is the NumericField form of EvaluateMod.
metrics := GNNI.EvaluateNF(mod2, testX, testY);

OUTPUT(metrics, NAMED('metrics'));

// PredictNF computes the neural network output given a set of inputs.
// This is the NumericField form of Predict. Note that these predictions are
// effectively the probabilities for each class (as output from softmax in the
// final NN layer). If we had used Tensors rather than NumericField, we
// could convert to a class label by using Utils.FromOneHot, or
// Utils.Probabilities2Class.
preds := GNNI.PredictNF(mod2, testX);

OUTPUT(testY, ALL, NAMED('testDat'));
OUTPUT(preds, NAMED('predictions'));



Is there options or way to fix this? I couldn&#39;t find anything mentioning dali in its [documentation][2].


  [1]: https://github.com/hpcc-systems/GNN/blob/master/Test/ClassificationTest.ecl
  [2]: https://hpccsystems.com/wp-content/uploads/2022/_documents/ECLR_EN_US/OPTION_Available_Options.html

</details>


# 答案1
**得分**: 1

错误消息本身表明存在2000Mb的硬限制，所以您唯一的选择是提交一个JIRA工单（track.hpccsystems.com），请求增加这个限制。作为一种解决方法，您可以尝试将您的#OPTION值更改为2000，并将trainCount减少到100万，看看是否适用于您的项目。

<details>
<summary>英文:</summary>

The error message itself indicates there is a hard limit of 2000Mb, so your only option is to submit a JIRA ticket (track.hpccsystems.com) asking for an increase in that limit. As a workaround, you can try changing your #OPTION value to 2000 and reduce your trainCount to 1 million and see if that works for your project. 

</details>

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

如何增加达利结果限制

问题

如何在HPCC ECL中使用SendGrid API发送POST请求？

有没有一种方式在 ECL 中可视化 BLOB 数据？

来自WHEN函数的意外结果

如何动态生成并输出多个文件在ECL中？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论